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MEMBERS OF THE FC RECEPTOR HOMOLOG GENE FAMILY 
(FCRH1-3, 6), RELATED REAGENTS, AND USES THEREOF 

This application claims the benefit of U.S. Provisional Application No. 
5 60/367,667, filed March 25, 2002. 
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FIELD OF THE INVENTION 
This invention relates generally to immunology and modulation of immunologic 
responses in the context of inflammatory diseases and cancer. 



BACKGROUND OF THE INVENTION 
Receptors for the Fc region (FcRs) of Igs have broad tissue distribution patterns 
and can modulate cellular and humoral immunity by linking their antibody ligands with 
effector cells of the immune system (Ravetch, J. V. & Kinet, J.-P. (1991) Annu. Rev. 
Immunol. 9, 457-492; Daeron, M. (1997) Annu. Rev. Immunol. 15, 203-234. These 
cellular receptors have the ability to sense humoral concentrations of antibody, initiate 
cellular responses in host defense, and participate in autoimmune disorders (Ravetch, J. 
V. & Bolland, S. (2001) Annu. Rev. Immunol. 19, 275-290). Their diverse regulatory 
roles depend on the Ig isotype specificity and cellular distribution of the individual FcR. 
These Ig superfamily members share similarities in their ligand binding subunits, and 
they may have inhibitory or activating signaling motifs in their intracellular domains or 
instead pair with signal transducing subunits possessing activating signaling motifs. 

Recently, characterization of FcR homologs in mice, the paired 
Ig-like receptors (Kubagawa, H. et al. (1997) Proc. Natl. Acad. Sci. USA 94, 5261- 
5266; Hayami, K. et al. (1997) J. Biol. Chem. 272, 7320-7327), and their relatives in 
humans the Ig-like transcripts/ leucocyte Ig-like receptors (Borges, L. et al. (1997) J. 
Immunol. 159, 5192-5196; Samaridis, J. & Colonna, M. (1997) Eur. J. Immunol. 27, 
660-665) have been elucidated. This multigene family, which includes the FcaR 
(Kremer, E. J. et al. (1992) Hum. Genet. 89, 107-108) and the natural killer cell Ig-like 
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receptors (Wagtmann, N. et al. (1997) Curr. Biol. 7, 615-618), is located in a human 
chromosome 19ql3 region known as the leucocyte receptor complex (LRC) (Wende, H. 
et al. (1999) Mamm. Genome 10, 154-160; Wilson, M. J. et al. (2000) Proc. Natl. Acad. 
Sci. USA 97, 4778-4783). These Ig-like multigene families belong to a larger class of 
5 receptors characterized by their possession of common cytoplasmic tyrosine-based 
signaling motifs. These can be either immunoreceptor tyrosine-based activation motifs 
(ITAMs) containing two repeats of the consensus sequence Y-X-X-L/I spaced by 6-8 
amino acids (E/D)-X-X-Y-X-X-(L/I)-X 6 - 8 -Y-X-X-(L/I) (SEQ ID NO:64, with six amino 
acid between the consensus sequences; SEQ ID NO:65, with seven amino acid residues 
1 0 between the consensus sequences; and SEQ ID NO:66, with eight amino acid residues 
between the consensus sequences) or immunoreceptor tyrosine-based inhibitory motifs 
(ITMs) with a 6-amino acid consensus sequence (I/V /L/S)-X-Y-X-X-(L/V) (SEQ ID 
NO:67) (Reth, M. (1992) Annu. Rev. Immunol. 10, 97-121; Vely, F. & Vivier, E. 
(1997) J. Immunol. 159, 2075-2077; Ravetch, J. V. & Lanier, L. L. (2000) Science 290, 
1 5 84-89; Gergely, J. et al. (1999) Immunol. Lett. 68,3-15). The phylogenetic conservation 
of these types of receptors in birds (Dennis, G. et al. (2000) Proc. Natl. Acad. Sci. USA 
97, 13245-13250) and bony fish (Yoder, J. A. et al. (2001) PrOc. Natl. Acad. Sci. USA 
98, 6771-6717) is indicative of their biological value. After ligand binding of the 
activating receptor complexes, IT AM tyrosines are rapidly phosphorylated by Src 
20 family kinases to initiate a cascade of signaling events that trigger cellular activation. In 
the case of ITlM-bearing receptors, the tyrosines provide a docking site for 
phosphatases containing Src homology 2 domains that can abrogate cellular activation 
(Long, E. O. (1999) Annu. Rev. Immunol. 17, 875-904; Unkeless, J. C. & Jin, J. (1997) 
Curr. Opin. Immunol. 9, 338-343). The balance in the utilization of these activating and 
25 inhibitory receptor pairs can serve to modulate cellular responses to a variety of stimuli. 

The genes encoding the classical FcyRs, FcyRI, FcyRII, FcyRUI, and FcsRI, lie 
on the long arm of chromosome 1 (lq21-23) near the polymeric Ig receptor (plgR) and 
Fca/wR genes (lq32) (20-23). Members of this FcR subfamily have relatively low 
extracellular homology with the FcR-related genes that reside in the LRC on 
30 chromosome 19. Like the FcyR- and FceR-activating receptors, the ligand binding chain 
of the FcaR coassociates with the ITAM containing FcR common y-chain 
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(Pfefferkorn, L. C. & Yeaman, G. R. (1994) J. Immunol. 153, 3228-3236; Morton, E. 
C. et al. (1995) J. Biol. Chem. 270, 29781-29787). New members of the FcR family 
were sought which could have diverse signally properties and oncogenic potential. 



5 SUMMARY OF THE INVENTION 

In accordance with the purpose(s) of this invention, as embodied and broadly 
described herein, this invention, in one aspect, relates to members of a cluster of FcR 
and FcR gene relatives encoded, for example, by genes in the human chromosome 
lq21-23 region, or analogous region in non-human subjects. The members are Type I 

10 transmembrane receptors, or alternatively spliced forms thereof, with homology to the 
FcR family and are referred to herein as FcRHs. Each FcRH can comprise an 
extracellular region, a transmembrane region, and a cytoplasmic region. The 
cytoplasmic region preferably comprises one or more immunoreceptor tyrosine-based 
inhibitory or activation motifs ('TTIMs" or 'TTAMs). 

1 5 The invention relates to polypeptides corresponding to isolated FcRHs (e.g., 

huFcRH 1, 2, 3, and 6 and moFcRHl, 2, and 3), as well as fragments and isoforms 
thereof. The invention further relates to nucleic acids that encode the FcRHs, as well as 
hybridization probes related thereto and complementary sequences. The invention 
further provides vectors and cells related to the nucleic acids of the invention. 

20 The invention further relates to making an FcRH, or a fragment or variant 

thereof, comprising culturing a cell comprising a vector of the invention under 
conditions permitting expression of the FcRH. The invention also provides an antibody 
reagent kit comprising the antibody, or a fragment or variant thereof, and reagents for 
detecting binding of the antibody, fragment, or antibody variant to a ligand. 

25 The invention further relates to uses of the polypeptides, nucleic acids and 

antibodies of the invention. For example, the invention relates to methods of 
diagnosing and methods of treating a malignancy of hematopoietic cell lineage or an 
inflammatory or autoimmune disease in a subject. The invention also relates to 
modulation of a humoral immune response in a subject. 

30 Additional advantages of the invention will be set forth in part in the description 

which follows, and in part will be obvious from the description, or may be learned by 
practice of the invention. The advantages of the invention will be realized and attained 
by means of the elements and combinations particularly pointed out in the appended 
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claims. It is to be understood that both the foregoing general description and the 
following detailed description are exemplary and explanatory only and are not 
restrictive of the invention, as claimed. 

5 BRIEF DESCRIPTION OF THE DRAWINGS 

The accompanying drawings, which are incorporated in and constitute a part of 
this specification, illustrate (one) several embodiments) of the invention and together 
with the description, serve to explain the principles of the invention. 

Figure 1 shows the relative position of the FcRH locus within the FcR cluster on 
10 chromosome 1. The cytogenetic location of the FcR genes is approximated from the 
GenBank Mapview database. The BAC clones (4, GenBank accession no. AL139409; 
3, GenBank accession no. AL356276; 2, GenBank accession no. AL135929; and 1, 
GenBank accession no. AL353721) that span the locus are oriented in relation to their 

respective FcRH genes (shaded area). 
1 5 Figure 2 shows the structural and sequence diversity of FcRHl , FcRH2, and 

FcRH3. Figure 2A is a schematic representation of FcRH molecules. The three cDNAs 
encode type I transmembrane proteins with similar extracellular domains, but different 
cytoplasmic regions. The extracellular (EC) regions contain different numbers of C2- 
like Ig domains and potential sites of N-linked glycosylation. The transmembrane (TM) 
20 domains are uncharged, except in the case of FcRHl. The cytoplasmic (CY) region of 
FcRHl contains two ITAMs (light gray boxes) and one ITAM-like region (small, lined 
box), whereas FcRH2 contains one HAM and two ITIMs (dark gray boxes). FcRH3 has 
a long cytoplasmic tail with one HAM, one ITIM, and an ITAM-like region. The amino 
acid length of each region is indicated. Figure 2B shows me multiple alignment 
25 comparison of FcRHl, FcRH2, and FcRH3 amino acid sequences (one-letter code) 

based on the FcRH3 sequence. Amino acid identity is represented by dots, and gaps are 
indicated by dashes. Predicted N-linked glycosylation sites and transmembrane domains 
are underlined in black. Consensus ITAM (bold) and ITIM (bold, underlined) motifs are 
indicated. Putative structural domains are labeled: SP, signal peptide; EC, extracellular 
30 domain; MP-TM, membrane proximal-transmembrane; and CY, cytoplasmic regions. 
Amino acid lengths are indicated in parentheses. 

Figure 3 shows a composite analysis of the extracellular homology among 
FcRH and FcR family members. Pairwise analysis of individual Ig-like subunite was 
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performed with the CLUSTAL method algorithm using FcRH3 as the index of 
comparison. Individual homologous domains are coded to indicate relatedness. Percent 
amino acid identities for related domains are indicated and aligned in relation to the 
comparative FcRH3 subunit. The amino acid identity for the membrane proximal 

5 domains (light gray subunits) of FcRH5 are provided as the range of identity for all 
individually related domains. Comparisons that are not applicable are left blank. Amino 
acid sequences were derived from IRTA1 (GenBank accession no. AF343659), IRTA2 
(GenBank accession no. AF34364), moFcRH (GenBank accession no. AAG28775) 
FcyRI (GenBank accession no. AAA35678), FcyRII (Swiss-Prot accession no. P31994), 

10 FcyRIII (Swiss-Prot accession no. P08637), FcsRI (Swiss-Prot accession no. P12319), 
and FcaRI (Swiss-Prot accession no. P24071). 

Figure 4 shows the relative location of the mouse FcR family. Location is 
indicated in reference to the human FcR related genes at Ch lq21-23 and their 
orthologous loci on mouse Ch 3 and Ch 1 . The microsatellite marker d3Mitl87 is 

1 5 located within moFcRH 1 . 

Figure 5 shows the multiple alignment comparisons of huFcRHl-5 and mouse 
FcRHl and 2 amino acid sequences (one-letter code) based on the FcRH3 sequence. 
Amino acid gaps are indicated by dashes. Consensus IT AM (underlined) and ITIM 
(italic, underlined) motifs are indicated. Amino acid lengths are indicated in 

20 parentheses. 

Figure 6 shows domains marked to indicate relatedness of the Ig-like subunits. 
Ig-like domain homology was determined by generation of a phylogenetic tree using 
DNAStar software with the CLUSTAL program and assigning arbitrary colors to 
individual Ig-domains of a given branch. Amino acid identities for full length, 

25 extracellular and, cytoplasmic domain comparisons are based on huFcRH3. Closest 
cytoplasmic relatives are indicated in parentheses. Most identical extracellular 
comparisons between mouse and human relatives are highlighted in horizontal lines. 
Comparisons that are not applicable are left blank. 

Figure 7 shows the domains of huFcRHl-6, moFcRHl-3 and related proteins. 

30 Domains are colored to indicate relatedness of the Ig-like subunits. Ig-like domain 

homology was determined by generation of a phylogenetic tree using DNAStar software 
with the CLUSTAL program and assigning arbitrary colors to individual Ig-domains of 
a given branch. Amino acid identities for full length, extracellular and, cytoplasmic 
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domain comparisons are based on huFcRH3. Closest cytoplasmic relatives are indicated 
in parentheses. Most identical extracellular comparisons between mouse and human 
relatives are highlighted in red. Comparisons that are not applicable are left blank. 
Figure 8 shows the structural characteristics of the mouse FcRH isoforms. 

5 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
The present invention may be understood more readily by reference to the 
following detailed description of preferred embodiments of the invention and the 
Examples included therein and to the Figures and their previous and following 
10 description. 

In this specification and in the claims that follow, reference will be made to a 
number of terms that shall be defined to have the following meanings: 

As used in the specification and the appended claims, the singular forms "a," 
"an" and "the" include plural referents unless the context clearly dictates otherwise. 
1 5 Thus, for example, reference to a receptor includes mixtures of various receptors, 

reference to "a pharmaceutical carrier" includes mixtures of two or more such carriers, 
and the like. 

Ranges may be expressed herein as from "about" one particular value, and/or to 
"about" another particular value. When such a range is expressed, another embodiment 

20 includes from the one particular value and/or to the other particular value. Similarly, 
when values are expressed as approximations, by use of the antecedent "about," it will 
be understood that the particular value forms another embodiment. It will be further 
understood that the endpoints of each of the ranges are significant both in relation to the 
other endpoint, and independently of the other endpoint 

25 "Optional" or "optionally" means that the subsequently described event or 

circumstance may or may not occur, and that the description includes instances where 
said event or circumstance occurs and instances where it does not. For example, the 
phrase "optionally contains two ITAM consensus motifs" means that the two ITAMs 
may or may not be present and that the description includes both the presence and 

30 absence of two ITAM consensus motifs. 

As used throughout, by "subject" is meant an individual. Preferably, the subject is 
a mammal such as a primate, and, more preferably, a human. The term "subject" can 
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include domesticated animals, such as cats, dogs, etc., livestock (e.g., cattle, horses, pigs, 
sheep, goats, etc.), and laboratory animals (e.g., mouse, rabbit, rat, guinea pig, etc.). 

By "isolated nucleic acid" is meant a nucleic acid the structure of which is not 
identical to that of the naturally occurring nucleic acid or to that of any fragment of the 
5 naturally occurring genomic nucleic acid spanning more than three separate genes. The 
term therefore covers, for example, (a) a DNA which has the sequence of part of the 
naturally occurring genomic DNA molecules but is not flanked by both of the coding 
sequences that flank that part of the molecule in the genome of the organism in which it 
naturally occurs; (b) a nucleic acid incorporated into a vector or into the genomic DNA 

10 of a prokaryote or eukaryote in a manner such that the resulting molecule is not 

identical to any naturally occurring vector or genomic DNA; (c) a separate molecule 
such as cDNA, a genomic fragment, a fragment produced by polymerase chain reaction, 
or a restriction fragment; and (d) a recombinant nucleotide sequence that is part of a 
hybrid gene, i.e., a gene encoding a fusion protein. 

15 By "label" is meant any detectable tag that can be attached directly (e.g., a 

fluorescent molecule integrated into a polypeptide or nucleic acid) or indirectly (e.g., by 
way of binging to a primary antibody a secondary antibody with an integrated fluorscent 
molecule) to the molecule of interest. A "label" is any tag that can be visualized with 
imaging methods. The detectable tag can be a radio-opaque substance, radiolabel, a 

20 fluorescent label, or a magnetic label. The detectable tag can be selected from the 
group consisting of gamma-emitters, beta-emitters, and alpha-emitters, gamma- 
emitters, positron-emitters, X-ray-emitters and fluorescence-emitters suitable for 
localization. Suitable fluorescent compounds include fluorescein sodium, fluorescein 
isothiocyanate, phycoerythrin, and Texas Red sulfonyl chloride. See, de Belder & Wik 

25 (Preparation and properties of fluorescein-labelled hyaluronate. Carbohydr. 

Res.44(2):251-57 (1975). Those skilled in the art will know, or will be able to ascertain 
with no more than routine experimentation, other fluorescent compounds that are 
suitable for labeling the molecule. 
Polypeptides 

30 The invention provides members of a cluster of FcR and FcR gene relatives 

encoded by genes in the human chromosome 1 q2 1 -23 region, or analogous region in 
non-human subjects, including for example, chromosome 3 in mouse. A consensus 
amino acid motif, based on the FcyRI, FcyRII, FcyRm, and plgR extracellular regions, 
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was used in a GenBank protein database query to identify member of the gene 
subfamily. Genomic clones were identified that were found to contain FcR relatives 
and are termed the Fc receptor homolog (FcRH) subfamily: specifically, FcRHl, 
FcRH2, FcRH3, and FcRH6. Also, found were mouse Fc receptor homologs 
5 designated moFcRl, 2, and 3. 

By "homologous" is meant about 25% percent homology or greater. Homology 
is also characterized by proximity in the location of the genes and by similarities as 
identified in a composite analysis. As used herein, "percent homology" of two amino 
acid sequences or of two nucleic acid sequences is determined using the algorithm of 
10 Karlin and Altschul (Proc. Natl. Acad. Sci. USA 87:2264-2268 (1990)). Such an 

algorithm is incorporated into the NBLAST and XBLAST programs of Altschul et al. 
(J. Mol. Biol. 215:403-410 (1990)). BLAST nucleotide searches are performed with 
the NBLAST program, score 100, wordlength = 12, to obtain nucleotide sequences 
homologous to a nucleic acid molecule of the invention. BLAST protein searches are 
1 5 performed with the XBLAST program, score = 50, wordlength = 3 , to obtain amino 
acid sequences homologous to a reference polypeptide . To obtain gapped alignments 
for comparison purposes, Gapped Blast is utilized as described in Altschul et al. (Nucl. 
Acids Res. 25: 3389-3402 (1997)). When utilizing BLAST and Gapped BLAST 
programs, the default parameters of the respective programs (e.g., XBLAST and 
20 NBLAST) are used. See http://ww.ncbi.nlm.nih.gov. 

By "FcRH" is meant a Type I transmembrane receptor, or an alternatively 
spliced form thereof, including, for example, a secreted form or a GPI-anchored form, 
with homology to the classical Fc receptor family. In a preferred embodiment, the 
' FcRH shows homology with the extracellular regions of FcyRI, FcyRII, FcyRIH, or 
25 plgR. More specifically, the FcRH shows homology with an amino acid sequence 
corresponding with the amino terminal sequences of the second Ig domains of the 
' FcyRs and the third Ig domain of plgR or FcyRHl . The FcRH can comprise an 
extracellular region, a transmembrane region, and a cytoplasmic region. The 
extracellular region preferably comprises one or more Ig domains, and more preferably 
30 less than 9, and even more preferably less than 7 or less than 8 Ig domains. Preferably, 
the cytoplasmic region comprises more than 107 (including more than 108, 109, 1 10, 
111, 112, 113, 114, 115, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 
132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, or 145 amino acids). 
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Alternatively, the cytoplasmic region comprises less than 104 amino acids (including 
less than 103, 102, 101, 100, 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 89, 88, 87, 86, 85, 
84, 83, 82, 81, 80). The cytoplasmic region preferably comprises one or more 
immunoreceptor tyrosine-based inhibitory or activation motifs ("ITIMs" or "ITAMs). 
5 The invention provides isolated FcRHs (e.g., huFcRH 1, 2, 3, and 6, and 

moFcRHl-3, as described in detail below), as well as fragments and isoforms thereof. 
The isolated amino acid sequences provided herein optionally are combined with a 
human signal sequence (e.g., MLPRLLLLICAPLCEP (SEQ ID NO:29), 
MLLWSLLVTFDAVTEQADS (SEQ ID NO:30), MLLWLLLLILTPGREQS (SEQ ID 
10 NO:3 1), MLLWTAVLLFVPCVG (SEQ ID NO:32)) or a mouse signal sequence (e.g., 
MPLCLLLLVFAPVGVQS (SEQ ID NO:69), MLPWLLLLICALPCEPA (SEQ ID 
NO:72), MSGSFSPCWFTQMWLTLLWTPVN (SEQ ID NO:79)). 

In one embodiment, the invention provides huFcRHl and its fragments and 
isoforms. Thus, in one embodiment of the isolated FcRH, the extracellular region 
15 comprises less than four Ig domains. Preferably, the cytoplasmic region comprises less 
than 104 amino acids and, even more preferably, comprises less than 104 and more than 
86 amino acids. In one embodiment, the transmembrane region comprises an acidic 
amino acid (e.g., glutamate or aspartate). The isolated FcRH of the invention comprises 
a cytoplasmic region having the amino acid sequence of SEQ ID NO:l, in the presence 
20 or absence of conservative amino acid substitutions. Further provided is the isolated 
FcRH, wherein the extracellular region comprises the amino acid sequence of SEQ ID 
NO:21, in the presence or absence of conservative amino acid substitutions, and in the 
presence and absence of a signal sequence. More specifically, the isolated FcRH 
comprises the amino acid sequence of SEQ ID NO:2, in the presence or absence of 
25 conservative amino acid substitutions, and in the presence or absence of a signal 

sequence. In one embodiment the signal sequence is MLPRLLLLICAPLCEP (SEQ ID 
NO:29). In a preferred embodiment, the FcRH of the invention is expressed by myeloid 
cells (e.g., granulocytes and monocytes). Additional characteristics of the full length 
FcRHl include a predicted molecular weight of about 46-47 kDaltons; about 425-435 
30 (e.g., 429) amino acids in length with about 35 strongly basic(f) amino acids (K,R), 
about 45 strongly acidic(-) amino acids (D,E), about 144 hydrophobic amino acids 
(A,I,L,F,W,V), and aboutl27 polar amino acids (N,C,Q,S,T,Y); a predicted isoelectric 
point of about 5-5.5 (e.g., 5.310); and charge of about -9 at PH 7.0. 
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In another embodiment, the invention provides an isolated FcRH corresponding 
to huFcRH2, its fragments, or isoforms. Thus, the invention provides a FcRH wherein 
the cytoplasmic region comprises less than 99 amino acids (e.g., 80, 81, 82, 83, 84, 85, 
86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98) and wherein the receptor further 

5 comprises an extracellular domain with up to four Ig domains and up to five N-linked 
glycosylation sites. More specifically, the isolated FcRH has a cytoplasmic region that 
comprises the amino acid sequence of SEQ ID NO:3, in the presence or absence of 
conservative amino acid substitutions, or an extracellular region comprising SEQ ID 
NO:22, in the presence or absence of conservative amino acid substitutions, and in the 

10 presence or absence of a signal sequence. Even more specifically, the isolated FcRH 
comprises the amino acid sequence of SEQ ID NO:4, in the presence or absence of 
conservative amino acid substitutions, and in the presence or absence of a signal 
sequence. In one embodiment, the signal sequence WSLLVIFDAVTEQADS (SEQ ID 
NO:30). Additional characteristics of the full length FcRHl include a predicted 

15 molecular weight of about 50-60 kDaltons; about 495-515 (e.g., 508) amino acids in 
length with about 44 strongly basic(+) amino acids (K,R), about 49 strongly acidic(-) 
amino acids (D,E), about 175 hydrophobic amino acids (A,I,L,F,W,V), and about 161 
polar amino acids (N,C,Q,S,T,Y); a predicted isoelectric point of about 6-6.5 (e.g., 

6.1 88); and charge of about -4 at PH 7.0. 

20 In another embodiment, the invention provides huFcRH3, its fragments, and 

isoforms. More specifically, the invention provides an isolated FcRH having a 
cytoplasmic region that comprises more than 107 amino acids (e.g., 108, 109, 1 10, 1 1 1, 
112, 113, 114, 115, 116, 117, 118, 119, 120, 212, 122, 123, 124, 125, 126 ,127, 128, 
129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 

25 146, 247, 148, 149, 1 50 amino acids). Optionally, the isolated FcRH has a cytoplasmic 
region comprising one ITAM and one ITIM. More specifically, the cytoplasmic region 
comprises the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:23, in the presence 
or absence of conservative amino acid substitutions. In one embodiment, the 
extracellular domain of the FcRH comprises the amino acid sequence of SEQ ID 

30 NO:24, in the presence or absence of conservative amino acid substitutions, and in the 
presence or absence of a signal sequence. Also provided is an isolated FcRH 
comprising the amino acid sequence of SEQ ID NO:6 or SEQ ID NO:25, in the 
presence or absence of one or more amino acid substitutions, and in the presence or 



WO 03/089624 

PC17US03/09600 

11 

absence of a signal sequence. In one embodiment the signal sequence comprises 
MLLWLLLLELTPGREQS (SEQ ID NO:31). Additional characteristics of the full 
length FcRHl include a predicted molecular weight of about 80-90 kDaltons; about 
725-740 (e.g., 734) amino acids in length with about 68 strongly basic(+) amino acids 
5 (K,R), about 75 strongly acidic(-) amino acids (D,E), about 232 hydrophobic amino 
acids (A,I,L,F,W,V), and about 224 polar amino acids (N,C,Q,S,T,Y); a predicted 
isoelectric point of about 6.5-7.0 (e.g., 6.852); and charge of about -2 atPH 7.0. 

The invention further provides an isolated huFcRH6, its fragments, and 
isoforms. More specifically, the FcRH comprises a cytoplasmic region having the 

10 amino acid sequence of SEQ ID NO:26 5 in the presence or absence or one or more 
conservative amino acid substitutions. The extracellular domain comprises the amino 
acid sequence of SEQ ID NO:27, in the presence or absence of conservative amino acid 
substitutions, and in the presence or absence of a signal sequence. Also, provided by 
the invention is a FcRH having the amino acid substitutions of SEQ ID NO:28, in the 

1 5 presence or absence of conservative amino acid substitutions, and in the presence or 
absence of a signal sequence. In one embodiment the signal sequence is 
MLLWTAVLLFVPCVG (SEQ ID NO:32). 

The invention further provides a polypeptide comprising the amino acid 
sequence of SEQ ID NO:l, 21, 2, 3, 22, 4, 5, 23, 24, 6, 25, 26, 27, or 28, in the presence 

20 or absence of conservative amino acid substitutions. The invention also provides a 

polypeptide having at least 80, 85, 90, or 95% homology with SEQ ID NOs: 1, 21, 2, 3, 
22, 4, 5, 23, 24, 6, 25, 26, 27, or 28. 

The invention further provides an isolated moFcRHl isoform, its fragments, and 
isoforms. The moFcRHl is an isoform of SEQ ID NO:68. More specifically, the 

25 FcRH comprises four Ig domains, optionally having the sequence of SEQ ID NO: 70, in 
the presence or absence or one or more conservative amino acid substitutions, and in 
the presence or absence of a signal sequence (e.g., the sequence of SEQ ID NO:71). 

The invention further provides an isolated moFcRH2, its fragments, and 
isoforms. The provided isoforms include one isoform with a transmembrane region and 

30 one isoform lacking the transmembrane region. More specifically, the FcRH comprises 
a cytoplasmic region having the amino acid sequence of SEQ ID NO:76, in the presence 
or absence or one or more conservative amino acid substitutions. The extracellular 
domain comprises the amino acid sequence of SEQ ID NO:74, in the presence or 
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absence of conservative amino acid substitutions, and in the presence or absence of a 
signal sequence. Also, provided by the invention is a FcRH having the amino acid 
sequence of SEQ ID NO:73, which comprises a transmembrane region, or SEQ ID 
NO:77, which lacks the transmembrane region. In each case, the FcRH sequence can 
5 include the presence or absence of conservative amino acid substitutions, and the 

presence or absence of a signal sequence. In one embodiment the signal sequence is the 
sequence of SEQ ID NO:72. 

The invention also provided a moFcRH3, its fragments and isoforms. The 
cytoplasmic region can comprise the amino acid sequence of SEQ ED NO:81, in the 
1 0 presence or absence of conservative amino acid substitutions. Optionally, the 

extracellular domain comprises the amino acid sequence of SEQ ID NO:80, in the 
presence or absence of conservative amino acid substitutions or in the presence or 
absence of a signal sequence (e.g., the sequence of SEQ ID NO:79). The full length 
sequence optionally has the amin oacid sequence of SEQ ID NO:78, in the presence or 
15 absence of conservative amino acid substitutions or in the presence or absence of a 
signal sequence (e.g., the sequence of SEQ ID NO:79). 

Fragments, variants, or isoforms of the FcRHs of the invention are provided. It 
is understood that these terms include functional variants. Fragments can include the 
cytoplasmic region, the extracellular region, the transmembrane region or any portion of 
20 at least 1 0 amino acids or any combination of the regions or portions. The variants are 
produced by making amino acid substitutions, deletions, and insertions, as well as post- 
translational modifications. Variations in post-translational modifications can include 
variations in the type or amount of carbohydrate moieties of the protein core or any 
fragment or derivative thereof. Variations in amino acid sequence may arise naturally 
25 as allelic variations (e.g., due to genetic polymorphism) or may be produced by human 
intervention (e.g., by mutagenesis of cloned DNA sequences), such as induced point, 
deletion, insertion and substitution mutants. These modifications can result in changes 
in the amino acid sequence, provide silent mutations, modify a restriction site, or 

provide other specific mutations. 
3 0 Amino acid sequence modifications fall into one or more of three classes: 

substitutional, insertional or deletional variants. Insertions include amino and/or 
carboxyl terminal fusions as well as intrasequence insertions of single or multiple 
amino acid residues. Insertions ordinarily will be smaller insertions than those of 
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amino or carboxyl terminal fusions, for example, on the order of one to four residues. 
Deletions are characterized by the removal of one or more amino acid residues from the 
protein sequence. Typically, no more than about 2 to 6 residues are deleted at any one 
site within the protein molecule. These variants ordinarily are prepared by site-specific 
mutagenesis of nucleotides in the DNA encoding the protein, thereby producing DNA 
encoding the variant, and thereafter expressing the DNA in recombinant cell culture. 
Techniques for making substitution mutations at predetermined sites in DNA having a 
known sequence are well known and include, for example, Ml 3 primer mutagenesis 
and PCR mutagenesis. Amino acid substitutions are typically of single residues but 
may include multiple substitutions at different positions; insertions usually will be on 
the order of about from 1 to 10 amino acid residues but can be more; and deletions will 
range about from 1 to 30 residues, but can be more. Deletions or insertions preferably 
are made in adjacent pairs, i.e. a deletion of 2 residues or insertion of 2 residues. 
Substitutions, deletions, insertions or any combination thereof may be combined to 
arrive at a final construct. The mutations must not place the sequence out of reading 
frame and preferably will not create complementary regions that could produce 
secondary mRNA structure. Substitutional variants are those in which at least one 
residue has been removed and a different residue inserted in its place. Such 
substitutions generally are made in accordance with Table 1 and are referred to as 
conservative substitutions. 



TABLE 1: 



r 



Amino Acid Substitutions 



Original Residue 


Exemplary Substitute 


Ala 


Ser 


Arg 


Lys 


Asn 


Gin 


Asp 


Glu 


Cys 


Ser 


Gin 


Asn 


Glu 


Asp 


Gly 


Pro 


His 


Gin 


lie 


Leu; Val 


Leu 


lie; Val 


Lys 


Arg; Gin 


Met 


Leu; Be 


Phe 


Met; Leu; Tyr 


Ser 


Thr 


Thr 


Ser 
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Trp ^ 
Tyr Tr P5 phe 
Val He; Leu 




Substantial changes in function or immunological identity are made by selecting 
substitutions that are less conservative than those in Table 1, i.e., selecting residues that 
differ more significantly in their effect on maintaining (a) the structure of the 
5 polypeptide backbone in the area of the substitution, for example as a sheet or helical 
conformation, (b) the charge or hydrophobicity of the molecule at the target site or (c) 
the bulk of the side chain. The substitutions that in general are expected to produce the 
greatest changes in the protein properties will be those in which (a) a hydrophilic 
residue, e.g. seryl or threpnyl, is substituted for (or by) a hydrophobic residue, e.g. 
10 leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or proline is substituted 
for (or by) any other residue; (c) a residue having an electropositive side chain, e.g., 
lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative residue, e.g., 
glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g., phenylalanine, is 
substituted for (or by) one not having a side chain, e.g., glycine, in this case, (e) by 
1 5 increasing the number of sites for sulfation and/or glycosylation. 

Substitutional or deletional mutagenesis can be employed to insert sites for 
N-glycosylation (Asn-X-Thr/Ser) or O-glycosylation (Ser or Thr). Deletions of cysteine 
or other labile residues also may be desirable. Deletions or substitutions of potential 
proteolysis sites, e.g. Arg, is accomplished for example by deleting one of the basic 
20 residues or substituting one by glutaminyl or histidyl residues. 

Certain post-translational derivatizations are the result of the action of 
recombinant host cells on the expressed polypeptide. Glutaminyl and asparaginyl 
residues are frequently post-translationally deamidated to the corresponding glutamyl 
and asparyl residues. Alternatively, these residues are deamidated under mildly acidic 
25 conditions. Other post-translational modifications include hydroxylation of proline and 
lysine, phosphorylation of hydroxyl groups of seryl or threonyl residues, methylation of 
the o-amino groups of lysine, arginine, and histidine side chains (T.E. Creighton, 
Proteins: Structure and Molecular Properties, W. H. Freeman & Co., San Francisco pp 
79-86 [1983]), acetylation of the N-terminal amine and, in some instances, amidation of 
30 the C-terminal carboxyl. Modifications in the FcRH can also include modifications in 
glycosylation. 
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In all mutational events, it is understood that the controlling aspect of the 
mutation is the function that the subsequent protein possesses. The preferred mutations 
are those that do not detectably change the desired function or that increase the desired 
function. 



Nucleic Acids 

Also provided is an isolated nucleic acid that encodes the FcRH of the 
invention. The nucleic acid can be single or double stranded and can be RNA or DNA. 
More specifically, the invention provides an isolated nucleic acid, comprising a 

10 nucleotide sequence that encodes SEQ ID NO: 1, SEQ ID NO:21, SEQ ID NO:2. SEQ 
ID NO: 3, SEQ ID NO:22, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:25, SEQ ED NO:26, SEQ ID NO:27, SEQ ID NO:28, or SEQ ID 
NO:6, SEQ ID NO:70, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:76, SEQ ID 
NO:77, SEQ ID NO:78, SEQ ID NO:80, SEQ ID NO:81, optionally with conservative 

1 5 amino acid substitutions. Optionally the nucleic acid farther encodes a signal sequence 
(e.g., the signal sequences of SEQ ID NO:29, 30, 31, 32, 71, 75, 79). The isolated 
nucleic acid optionally encodes the sequences with 80, 85, 90, or 95 % identity. More 
specifically, the invention provides an isolated nucleic acid, comprising a nucleotide 
sequence of SEQ ID NO:7, SEQ ID NO:13, SEQ ID NO:8, SEQ ID NO:34, SEQ ID 

20 NO:9, SEQ ID NO:14, SEQ ID NO:10, SEQ ID NO:36, SEQ ID NO:l 1, SEQ ID 
NO: 15, SEQ ID NO: 16, SEQ ID NO: 12, SEQ ID NO:38, SEQ ID NO: 17, SEQ ID 
NO: 18, SEQ ID NO: 19, SEQ ID NO:20; SEQ ID NO:40, SEQ ID NO:84, SEQ ID 
NO: 85, SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID 
NO:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID 

25 NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ JD NO: 100, SEQ ID 

NO:101, or SEQ ID NO:102 . Optionally, the isolated nucleic acid can further included 
bases that encode a signal sequence and thus the nucleotide sequence encoding the 
extracellular region or full-length huFcRHl, 2, 3, or 6 can optionally further comprise 
the nucleotide sequence of SEQ ED NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID 

30 NO:39. Optionally, the isolated nucleic acids for moFcRHs include nucleic acid 

sequences that encode signal sequences as well, including for example, those portions 
of nucleic acid sequences SEQ ID NO: 101, SEQ ID NO:97, SEQ ID NO:94, SEQ ID 
NO:91, SEQ ID NO:88, SEQ ID NO:84. 
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Preferably the nucleic acid that encodes the full length FcRHl includes about 
1290 bases. The nucleic acid that encodes the full length FcRH2 includes about 1527 
bases, and the nucleic acid that encodes the full length FcRH3 includes about 2205 
bases. 

5 The invention also provides an isolated nucleic acid comprising a sequence that 

hybridizes under stringent conditions to a hybridization probe, wherein the 
hybridization probe comprises the nucleotide sequence of SEQ ID NO:7, SEQ ID 
. NO:13, SEQ ID NO:8, SEQ ID NO:34, SEQ ID NO:9, SEQ ID NO:14, SEQ ID NO:10 3 
SEQ ID NO:36, SEQ ID NO:ll, SEQ ID NO: 15, SEQ ID NO:16, SEQ ID NO:12, SEQ 

10 ID NO:38, SEQ ID NO: 17, SEQ ID NO: 1 8, SEQ ID NO: 19, and SEQ EDNO:20; SEQ 
ID NO:40, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:87, SEQ ID NO:88, SEQ ID 
NO:89, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID 
NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ED NO:97, SEQ ID NO:98, SEQ ID 
NO:99, SEQ ID NO:100, SEQ ID NO:101, or SEQ ID NO:102, or the complement of 

15 either sequence. 

Further provided is a single stranded nucleic acid that hybridizes under stringent 
conditions to a nucleic acid having the sequence of SEQ ED NO:7, SEQ ED NO:13, 
SEQ ID NO:8, SEQ ID NO:34, SEQ ID NO:9, SEQ ED NO:14, SEQ ED NO: 10, SEQ 
ED NO:36, SEQ ED NO:ll, SEQ ED NO:15, SEQ ED NO:16, SEQ ED NO:12, SEQ ED 
20 NO:38, SEQ ED NO:17, SEQ ED NO:18, SEQ ED NO:19, and SEQ ED NO:20; SEQ ED 
NO:40, SEQ ED NO:84, SEQ ED NO:85, SEQ ED NO:87, SEQ ED NO:88, SEQ ED 
NO:89, SEQ ED NO:90, SEQ ED NO:91 , SEQ ED NO:92, SEQ ED NO:93, SEQ ED 
NO:94, SEQ ED NO:95, SEQ ED NO:96, SEQ ED NO:97, SEQ ED NO:98, SEQ ED 
NO:99, SEQ ED NO:100, SEQ ED NO:101, or SEQ ED NO:102. 
25 By "hybridizing under stringent conditions" or "hybridizing under highly 

stringent conditions" is meant that the hybridizing portion of the hybridizing nucleic 
acid, typically comprising at least 15 (e.g., 20, 25, 30, or 50 nucleotides), hybridizes to 
all or a portion of the provided nucleotide sequence under stringent conditions. The 
term "hybridization" typically means a sequence driven interaction between at least two 
30 nucleic acid molecules, such as a primer or a probe and a gene. Sequence driven 
interaction means an interaction that occurs between two nucleotides or nucleotide 
analogs or nucleotide derivatives in a nucleotide specific manner. For example, G 
interacting with C or A interacting with T are sequence driven interactions. Typically 
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sequence driven interactions occur on the Watson-Crick face or Hoogsteen face of the 
nucleotide. The hybridization of two nucleic acids is affected by a number of 
conditions and parameters known to those of skill in the art. For example, the salt 
concentrations, pH, and temperature of the reaction all affect whether two nucleic acid 
5 molecules will hybridize. Generally, the hybridizing portion of the hybridizing nucleic 
acid is at least 80%, for example, at least 90%, 95%, or 98%, identical to the sequence 
of or a portion of a nucleic acid encoding an FcRH of the invention, or its complement. 
Hybridizing nucleic acids of the invention can be used, for example, as a cloning 
probe, a primer (e.g., for PCR), a diagnostic probe, or an antisense probe. 

10 Hybridization of the oligonucleotide probe to a nucleic acid sample typically is 
performed under stringent conditions. Nucleic acid duplex or hybrid stability is 
expressed as the melting temperature or Tm, which is the temperature at which a probe 
dissociates from a target DNA. This melting temperature is used to define the required 
stringency conditions. If sequences are to be identified that are related and substantially 

1 5 identical to the probe, rather than identical, then it is useful to first establish the lowest 
temperature at which only homologous hybridization occurs with a particular 
concentration of salt (e.g., SSC or SSPE). Assuming that a 1% mismatch results in a 
1°C decrease in the Tm, the temperature of the final wash in the hybridization reaction 
is reduced accordingly (for example, if sequence having >95% identity with the probe 

20 are sought, the final wash temperature is decreased by 5 °C). In practice, the change in 
Tm can be between 0.5 °C and 1.5 °C per 1% mismatch. Stringent conditions involve 
hybridizing at 68 °C in 5x SSC/5x Denhardfs solution/1.0% SDS, and washing in 0.2x 
SSC/0.1% SDS at room temperature. Moderately stringent conditions include washing 
in 3x SSC at 42 °C. The parameters of salt concentration and temperature can be varied 

25 to achieve the optimal level of identity between the probe and the target nucleic acid. 
Additional guidance regarding such conditions is readily available in the art, for 
example, in Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, Cold 
Spring Harbor Press, NY; and Ausubel et al. (eds.), 1995, Current Protocols in 
Molecular Biology, (John Wiley & Sons, NY) at Unit 2.10. 

30 The nucleic acids of the present invention are optionally labeled, directly or 

indirectly. Such labeled nucleic acids are useful in various diagnostic techniques 
including for example, in situ hybridization, FISH, in situ PCR, and PRINS. Both 



WO 03/089624 PCTAJS03/09600 

18 

methods involve the preparation of short sequences of single- stranded nucleic acid 
probes that are complementary to the nucleic acid sequences that encode an FcRH. 
See, e.g., M Andreeff and D Pinkel (1999), An Introduction to Flourescent In-Situ 
Hybridization: Principles and Clinical Applications, John Wiley & Sons, Ltd; Roche 
5 Applied Sciences (2000), Nonradioactive In Situ Hybridization Application Manual; 
Roche Applied Sciences (1999), PCR Manual, 2d edition, which are incorporated in 
their entirety for methods of using nucleic acids. 

Vectors, cells, and methods of using 
10 Also provided is an expression vector comprising a nucleic acid of the 

invention, wherein the nucleic acid is operably linked to an expression control 
sequence. A wide variety of expression system/regulatory sequence combinations may 
be employed in expressing the disclosed. Such useful regulatory sequences include, for 
example, the early or late promoters of SV40, CMV, vaccinia, polyoma or adenovirus, 
1 5 the lac system, the trp system, the TAC system, the TRC system, the LTR system, the 
major operator and promoter regions of phage lambda, the control regions of fd coat 
protein, the promoter for 3-phosphoglycerate kinase or other glycolytic enzymes, the 
promoters of acid phosphatase (for example, Pho5), the AOX 1 promoter of 
methylotrophic yeast, the promoters of the yeast a-mating factors, and other sequences 
20 known to control the expression of genes of prokaryotic or eukaryotic cells or their 
viruses, and various combinations thereof. 

Such an expression vector can be designed to be expressed by eukaryotic cells 
or prokaryotic cells. The vectors of the present invention thus provide DNA molecules 
which are capable of integration into a prokaryotic or eukaryotic chromosome and 
25 expression. The inserted genes in viral and retroviral vectors usually contain promoters, 
and/or enhancers to help control the expression of the desired gene product. A 
promoter is generally a sequence or sequences of DNA that function when in a 
relatively fixed location in regard to the transcription start site. A promoter contains 
core elements required for basic interaction of RNA polymerase and transcription 
30 factors, and may contain upstream elements and response elements. It has been shown 
that all specific regulatory elements can be cloned and used to construct expression 
vectors that are selectively expressed in specific cell types. For example, the glial 
fibrillary acetic protein (GFAP) promoter has been used to selectively express genes in 
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cells of glial origin. Expression vectors used in eukaryotic host cells (e.g., yeast, fungi, 
insect, plant, animal, human or nucleated cells) may also contain sequences necessary 
for the termination of transcription which may affect mRNA expression. These regions 
are transcribed as polyadenylated segments in the untranslated portion of the mRNA 
5 encoding tissue factor protein. The 3' untranslated regions also include transcription 
termination sites. It is preferred that the transcription unit also contain a 
polyadenylation region. One benefit of this region is that it increases the likelihood that 
the transcribed unit will be processed and transported like mRNA. The identification 
and use of polyadenylation signals in expression constructs is well established. It is 

10 preferred that homologous polyadenylation signals be used in the transgene constructs. 
In certain transcription units, the polyadenylation region is derived from the SV40 early 
polyadenylation signal and consists of about 400 bases. It is also preferred that the 
transcribed units contain other standard sequences alone or in combination with the 
above sequences improve expression from, or stability of, the construct. 

1 5 The invention further provides transfer vectors, which include any nucleotide 

construction used to deliver genes into cells (e.g., a plasmid), or as part of a general 
strategy to deliver genes, e.g., as part of recombinant retrovirus or adenovirus (Ram et 
al. Cancer Res. 53:83-88, (1993)). As used herein, plasmid or viral vectors are agents 
that transport the disclosed nucleic acids into the cell without degradation and include a 

20 promoter yielding expression of the gene in the cells into which it is delivered. In some 
embodiments the FcRHs are derived from either a virus or a retrovirus. Viral vectors 
include, for example, Adenovirus, Adeno-associated virus, Herpes virus, Vaccinia 
virus, Polio virus, AIDS virus, neuronal trophic virus, Sindbis and other RNA viruses, 
including these viruses with the HIV backbone. Also preferred are any viral families 

25 that share the properties of these viruses that make them suitable for use as vectors. 
Retroviruses include Murine Maloney Leukemia virus, MMLV, and retroviruses that 
express the desirable properties of MMLV as a vector. Retroviral vectors are able to 
carry a larger genetic payload, i.e., a transgene or marker gene, than other viral vectors, 
and for this reason are a commonly used vector. However, they are not as useful in 

30 non-proliferating cells. Adenovirus vectors are relatively stable and easy to work with, 
have high titers, and can be delivered in aerosol formulation, and can transfect non- 
dividing cells. Pox viral vectors are large and have several sites for inserting genes, 
they are thermostable and can be stored at room temperature. A preferred embodiment 
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is a viral vector which has been engineered so as to suppress the immune response of 
the host organism, elicited by the viral antigens. 

Viral vectors can have higher transaction (ability to introduce genes) abilities 
than chemical or physical methods to introduce genes into cells. Typically, viral 
5 vectors contain, nonstructural early genes, structural late genes, an RNA polymerase ffl 
transcript, inverted terminal repeats necessary for replication and encapsidation, and 
promoters to control the transcription and replication of the viral genome. When 
engineered as vectors, viruses typically have one or more of the early genes removed 
and a gene or gene/promotor cassette is inserted into the viral genome in place of the 
1 0 removed viral DNA. Constructs of this type can carry up to about 8 kb of foreign 
genetic material. The necessary functions of the removed early genes are typically 
supplied by cell lines that have been engineered to express the gene products of the 

early genes in trans. 

A retrovirus is an animal virus belonging to the virus family of Retroviridae, 
1 5 including any types, subfamilies, genus, or tropisms. Retroviral vectors, in general, are 
described by Verma, I.M., Retroviral vectors for gene transfer. In Microbiology-1985, 
American Society for Microbiology, pp. 229-232, Washington, (1985), which is 
incorporated by reference herein. Examples of methods for using retroviral vectors for 
gene therapy are described in U.S. Patent Nos. 4,868,116 and 4,980,286; PCT 
20 applications WO 90/02806 and WO 89/07 1 3 6; and Mulligan, (Science 260:926-932 
(1993)); the teachings of which are incorporated herein by reference. A retrovirus is 
essentially a package which has packed into it nucleic acid cargo. The nucleic acid 
cargo carries with it a packaging signal, which ensures that the replicated daughter 
molecules will be efficiently packaged within the package coat. In addition to the 
25 package signal, there are a number of molecules that are needed in cis, for the 
replication, and packaging of the replicated virus. Typically a retroviral genome, 
contains the gag, pol, and env genes which are involved in the making of the protein 
coat. It is the gag, pol, and env genes which are typically replaced by the foreign DNA 
that it is to be transferred to the target cell. Retrovirus vectors typically contain a 
30 packaging signal for incorporation into the package coat, a sequence which signals the 
start of the gag transcription unit, elements necessary for reverse transcription, 
including a primer binding site to bind the tRNA primer of reverse transcription, 
terminal repeat sequences that guide the switch of RNA strands during DNA synthesis, 



WO 03/089624 . PCT/US03/09600 

21 

a purine rich sequence 5' to the 3' LTR that serve as the priming site for the synthesis of 
the second strand of DNA synthesis, and specific sequences near the ends of the LTRs 
that enable the insertion of the DNA state of the retrovirus to insert into the host 
genome. The removal of the gag, pol, and env genes allows for about 8 kb of foreign 
5 sequence to be inserted into the viral genome, become reverse transcribed, and upon 
replication be packaged into a new retroviral particle. This amount of nucleic acid is 
sufficient for the delivery of a one to many genes depending on the size of each 
transcript. It is preferable to include either positive or negative selectable markers 
along with other genes in the insert. 

1 0 Since the replication machinery and packaging proteins in most retroviral 

vectors have been removed (gag, pol, and env), the vectors are typically generated by 
placing them into a packaging cell line. A packaging cell line is a cell line that has been 
transfected or transformed with a retrovirus that contains the replication and packaging 
machinery, but lacks any packaging signal. When the vector carrying the DNA of 

1 5 choice is transfected into these cell lines, the vector containing the gene of interest is 
replicated and packaged into new retroviral particles, by the machinery provided in cis 
by the helper cell. The genomes for the machinery are not packaged because they lack 
the necessary signals. 

The construction of replication-defective adenoviruses has been described 

20 (Berkner et al. 5 J. Virology 61:1213-1220 (1987); Massie et al., Mol. Cell. Biol. 6:2872- 
2883 (1986); Haj-Ahmad et al., J. Virology 57:267-274 (1986); Davidson et al., J. 
Virology 61:1226-1239 (1987); Zhang, Generation and identification of recombinant 
adenovirus by liposome-mediated transfection and PCR analysis, BioTechniques 
15:868-872 (1993)). The benefit of the use of these viruses as vectors is that they are 

25 limited in the extent to which they can spread to other cell types, since they can 
replicate within an initial infected cell, but are unable to form new infectious viral 
particles. Recombinant adenoviruses have been shown to achieve high efficiency gene 
transfer after direct, in vivo delivery to airway epithelium, hepatocytes, vascular 
endothelium, CNS parenchyma and a number of other tissue sites (Morsy, J. Clin. 

30 Invest. 92:1580-1586 (1993); Kirshenbaum, J. Clin. Invest. 92:381-387 (1993); 

Roessler, J. Clin. Invest. 92:1085-1092 (1993); Moullier, Nature Genetics 4:154-159 
(1993); La Salle, Science 259:988-990 (1993); Gomez-Foix, J. Biol. Chem. 
267:25129-25134 (1992); Rich, Human Gene Therapy 4:461-476 (1993); Zabner, 
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Nature Genetics 6:75-83 (1994); Guzman, Circulation Research 73:1201-1207 (1993); 
Bout; Human Gene Therapy 5:3-10 (1994); Zabner, Cell 75:207-216 (1993); Caillaud, 
Eur. J. Neuroscience 5:1287-1291 (1993); and Ragot, J. Gen. Virology 74:501-507 
(1993)). Recombinant adenoviruses achieve gene transduction by binding to specific 

5 cell surface receptors, after which the vims is internalized by receptor-mediated 
endocytosis, in the same manner as wild type or replication-defective adenovirus 
(Chardonnet and Dales, Virology 40:462-477- (1970); Brown and Burlingham, J. 
Virology 12:386-396 (1973); Svensson and Persson, J. Virology 55:442-449 (1985); 
Seth, et al., J. Virol. 51:650-655 (1984); Seth, et al., Mol. Cell. Biol. 4:1528-1533 

10 (1984); Varga et al., J. Virology 65:6061-6070 (1991); Wickham et al., Cell 73:309-319 
(1993)). 

A viral vector can be one based on an adenovirus which has had the El gene 
removed and these virons are generated in a cell line such as the human 293 cell line. 
In another preferred embodiment both the El and E3 genes are removed from the 

15 adenovirus genome. 

Another type of viral vector is based on an adeno-associated virus (AAV). This 

defective parvovirus is a preferred vector because it can infect many cell types and is 
nonpathogenic to humans. AAV type vectors can transport about 4 to 5 kb and wild 
type AAV is known to stably insert into chromosome 19. Vectors which contain this 
20 site specific integration property are preferred. An especially preferred embodiment of 
this type of vector is the P4. 1 C vector produced by Avigen, San Francisco, CA, which 
can contain the herpes simplex vims thymidine kinase gene, HSV-tk, and/or a marker 
gene, such as the gene encoding the green fluorescent protein, GFP. 

In another type of AAV virus, the AAV contains a pair of inverted terminal 
25 repeats (ITRs) which flank at least one cassette containing a promoter which directs 
cell-specific expression operably linked to a heterologous gene. Heterologous in this 
context refers to any nucleotide sequence or gene which is not native to the AAV or 
B19 parvovirus. 

Typically the AAV and B 1 9 coding regions have been deleted, resulting in a 
30 safe, noncytotoxic vector. The AAV ITRs, or modifications thereof, confer infectivity 
and site-specific integration, but not cytotoxicity, and the promoter directs cell-specific 
expression. United States Patent No. 6,261,834 is herein incorproated by reference for 
material related to the AAV vector. 
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Molecular genetic experiments with large human herpesviruses have provided a 
means whereby large heterologous DNA fragments can be cloned, propagated and 
established in cells permissive for infection with herpesviruses (Sun et aL, Nature 
genetics 8: 33-41, 1994; Cotter and Robertson, Curr Opin Mol Ther 5: 633-644, 1999). 
5 These large DNA viruses (herpes simplex virus (HSV) and Epstein-Barr virus (EBV), 
have the potential to deliver fragments of human heterologous DNA > 150 kb to 
specific cells. EBV recombinants can maintain large pieces of DNA in the infected B- 
cells as episomal DNA. Individual clones carried human genomic inserts up to 330 kb 
appeared genetically stable The maintenance of these episomes requires a specific EBV 

10 nuclear protein, EBNA1 , constitutively expressed during infection with EBV. 

Additionally, these vectors can be used for transfection, where large amounts of protein 
can be generated transiently in vitro. Herpesvirus amplicon systems are also being used 
to package pieces of DNA > 220 kb and to infect cells that can stably maintain DNA as 
episomes. Other useful systems include, for example, replicating and host-restricted 

1 5 non-replicating vaccinia virus vectors. 

The invention also provides an isolated cell comprising a vector of the 
invention. The isolated cell can be either a eukaryotic or prokaryotic cell, such as 
strains of E. coli, Pseudomonas, Bacillus , Streptomyces; fungi such as yeasts 
(Saccharomyces , and methylotrophic yeast such as Pichia, Candida, Hansenula, and 

20 Torulopsis); and animal cells, such as CHO, Rl. 1 , B-W and LM cells, African Green 
Monkey kidney cells (for example, COS 1, COS 7, BSC1, BSC40, and BMT10), insect 
cells (for example, Sf9), and human cells and plant cells in tissue culture. 

Also provided is a method of making a FcRH, or a fragment or variant thereof 
comprising culturing a cell comprising a vector of the invention under conditions 

25 permitting expression of the FcRH. The method comprises culturing a cell comprising 
an exogeneous nucleic acid that encodes the FcRH, fragment, or variant, wherein the 
exogeneous nucleic acid is operably linked to an expression control sequence, and 
wherein the culture conditions permit expression of the FcRH, fragment, or variant 
under the control of the expression control sequence; harvesting the medium from the 

30 cultured cells, and isolating the FcRH, fragment, or varinat from the cell or culture 

medium. Optionally the exogenous nucleic acid is the nucleotide sequence of SEQ ID 
NO:7, SEQ ID NO:13, SEQ ID NO:8, SEQ ID NO:34, SEQ ID NO:9, SEQ ID NO:14, 
SEQ IDNO:10, SEQ IDNO:36, SEQIDNOill, SEQ H)NO:15, SEQIDNO:16, SEQ 
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ID NO: 12, SEQ ID NO:38, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, and SEQ 
JD NO:20; SEQ ID NO:40, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:87, SEQ ID 
NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID 
NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID 
5 NO:98. SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101 , or SEQ ID NO:102 or a 
combination thereof. Optionally, the exogenous nucleic acid further comprises a 
nucleotide sequence that encodes a signal sequence. In the recombinant methods, the 
cell can be any known host cell, including for example, a prokaryotic or eukaryotic cell. 
The nucleic acids that are delivered to cells, generally in a plasmid or other vector, 
10 typically contain expression controlling systems. For example, the inserted genes in 
viral and retroviral systems usually contain promoters, and/or enhancers to help control 
the expression of the desired gene product. 

Those skilled in the art of molecular biology will understand that a wide variety 
of expression systems may be used to produce recombinant FcRH polypeptides (as well 
1 5 as fragments, fusion proteins, and amino acid sequence variants with therapeutic 

activity) for use in the methods of the invention. Thus, FcRH may be produced using 
prokaryotic host cells (e.g., Escherichia coli) or eukaryotic host cells (e.g., 
Saccharomyces cerevisiae, insect cells such as SB cells, or mammalian cells such as 
CHO cells, COS-1, Nffl 3T3, or HeLa cells). These cells are commercially available 
20 from, for example, the American Type Culture Collection, Rockville, MD (see also F. 
Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, New York, 
NY, 1998). The method of transformation and the choice of expression vector will 
depend on the host system selected. Transformation and transfection methods are 
described, e.g., in Ausubel et al., supra, and expression vectors may be chosen from the 
25 numerous examples known in the art. 

A nucleic acid sequence encoding an FcRH is introduced into a plasmid or other 
vector, which is then used to transform living cells. Constructs in which a cDNA 
containing the entire FcRH coding sequence, a fragment of the FcRH coding sequence, 
amino acid variations of the FcRH coding sequence, or fusion proteins of the 
30 aforementioned, inserted in the correct orientation into an expression plasmid, may be 
used for protein expression. In some cases, for example, it may be desirable to express 
the FcRH coding sequence under the control of an inducible or tissue-specific promoter. 
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Eukaryotic expression systems permit appropriate post-translational 
modifications to expressed proteins. Thus, eukaryotic, and more preferably mammalian 
expression systems, allow glycosylations patterns comparable to naturally expressed 
FcRH. Transient transfection of a eukaryotic expression plasmid allows the transient 
5 production of FcRH by a transfected host cell. FcRH may also be produced by a stably- 
transfected mammalian cell line. A number of vectors suitable for stable transfection of 
mammalian cells are available to the public (e.g., see Pouwels et al., Cloning Vectors: 
A Laboratory Manual, 1985, Supp. 1987), as are methods for constructing such cell 
lines (see e.g., F. Ausubel et al., Current Protocols in Molecular Biology, John Wiley & 

10 Sons, New York, NY, 1998). Another preferred eukaryotic expression system is the 
baculovirus system using, for example, the vector pBacPAK9, which is available from 
Clontech (Palo Alto, CA). If desired, this system may be used in conjunction with 
other protein expression techniques, for example, the myc tag approach described by 
Evan et al. (Mol. Cell Biol. 5:3610-3616, 1985) or analogous tagging approaches, e.g., 

1 5 using a hemagluttinin (HA) tag. 

Once the recombinant protein is expressed, it can be isolated from the 
expressing cells by cell lysis followed by protein purification techniques such as affinity 
chromatography. In this example, an antibody that specifically binds to FcRH, which 
may be produced by methods that are well-known in the art, can be attached to a 

20 column and used to isolate FcRH. Once isolated, the recombinant protein can, if 

desired, be purified further, e.g., by high performance liquid chromatography (HPLC; 
e.g., see Fisher, Laboratory Techniques In Biochemistry And Molecular Biology, Work 
and Burdon, Eds., Elsevier, 1980). 



25 Antibodies 

The invention also provides a purified antibody or immunologic fragment 
thereof, wherein the antibody or fragment thereof selectively binds to an FcRH. As 
used herein, the term "antibody" encompasses, but is not limited to, whole 
immunoglobulin (i.e., an intact antibody) of any class. Native antibodies are usually 
30 heterotetrameric glycoproteins, composed of two identical light (L) chains and two 

identical heavy (H) chains. Typically, each light chain is linked to a heavy chain by one 
covalent disulfide bond, while the number of disulfide linkages varies between the 
heavy chains of different immunoglobulin isotypes. Each heavy and light chain also 
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has regularly spaced intrachain disulfide bridges. Each heavy chain has at one end a 
variable domain (V(H)) followed by a number of constant domains. Each light chain 
has a variable domain at one end (V(L)) and a constant domain at its other end- the 
constant domain of the light chain is aligned with the first constant domain of the heavy 
5 chain, and the light chain variable domain is aligned with the variable domain of the 
heavy chain. Particular amino acid residues are believed to form an interface between 
the light and heavy chain variable domains. The light chains of antibodies from any 
vertebrate species can be assigned to one of two clearly distinct types, called kappa (k) 
and lambda (1), based on the amino acid sequences of their constant domains. 
10 Depending on the amino acid sequence of the constant domain of their heavy chains, 
immunoglobulins can be assigned to different classes. There are five major classes of 
immunoglobulins: IgA, IgD, IgE, IgG and IgM, and several of these may be further 
divided into subclasses (isotypes), e.g., IgG-1 , IgG-2, IgG-3, and IgG-4; IgA-1 and IgA- 
2. The heavy chain constant domains that correspond to the different classes of , 
1 5 immunoglobulins are called alpha, delta, epsilon, gamma, and mu, respectively. 

The term "variable" is used herein to describe certain portions of the variable 
domains that differ in sequence among antibodies and are used in the binding and 
specificity of each particular antibody for its particular antigen. However, the 
variability is not usually evenly distributed through the variable domains of antibodies. 
20 It is typically concentrated in three segments called complementarity determining 
regions (CDRs) or hypervariable regions both in the light chain and the heavy chain 
variable domains. The more highly conserved portions of the variable domains are 
• called the framework (FR). The variable domains of native heavy and light chains each 
comprise four FR regions, largely adopting a b-sheet configuration, connected by three 
25 CDRs, which form loops connecting, and in some cases forming part of, the b-sheet 
structure. The CDRs in each chain are held together in close proximity by the FR 
regions and, with the CDRs from the other chain, contribute to the formation of the 
antigen binding site of antibodies (see Rabat E. A. et al., "Sequences of Proteins of 
Immunological Interest" National Institutes of Health, Bethesda, Md. (1987)). The 
30 constant domains are not involved directly in binding an antibody to an antigen, but 
exhibit various effector functions, such as participation of the antibody in antibody- 
dependent cellular toxicity. , 
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The term "antibody or fragments thereof can also encompass chimeric 
antibodies and hybrid antibodies, with dual or multiple antigen or epitope specificities, 
and fragments, such as F(ab') 2 , Fab 5 , Fab and the like, including hybrid fragments. 

Thus, fragments of the antibodies that retain the ability to bind their specific antigens 
5 are provided. For example, fragments of antibodies which maintain FcRH binding 
activity are included within the meaning of the term "antibody or fragment thereof." 
Such antibodies and fragments can be made by techniques known in the art and can be 
screened for specificity and activity according to the methods set forth in the Examples 
and in general methods for producing antibodies and screening antibodies for 

10 specificity and activity (See Harlow and Lane. Antibodies, A Laboratory Manual. Cold 
Spring Harbor Publications, New York, (1988)). 

Also included within the meaning of "antibody or fragments thereof are 
conjugates of antibody fragments and antigen binding proteins (single chain antibodies) 
as described, for example, in U.S. Pat. No. 4,704,692, the contents of which are hereby 

1 5 incorporated by reference . 

In one embodiment, the antibody is a monoclonal antibody. The term 
"monoclonal antibody" as used herein refers to an antibody obtained from a 
substantially homogeneous population of antibodies, i.e., the individual antibodies 
comprising the population are identical except for possible naturally occurring 

20 mutations that may be present in minor amounts. The monoclonal antibodies herein 
specifically include "chimeric" antibodies in which a portion of the heavy and/or light 
chain is identical with or homologous to corresponding sequences in antibodies derived 
from a particular species or belonging to a particular antibody class or subclass, while 
the remainder of the chain(s) is identical with or homologous to corresponding 

25 sequences in antibodies derived from another species or belonging to another antibody 
class or subclass, as well as fragments of such antibodies, so long as they exhibit the 
desired activity (See, U.S. Pat. No. 4,816,567 and Morrison et al., Proc. Natl. Acad. Sci. 
USA, 81:6851-6855 (1984)). 

Monoclonal antibodies of the invention may be prepared using hybridoma 

30 methods, such as those described by Kohler and Milstein, Nature, 256:495 (1975) or 
Harlow and Lane, Antibodies, A Laboratory Manual. Cold Spring Harbor Publications, 
New York, (1988). In a hybridoma method, a mouse or other appropriate host animal, 
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is 
are 



typically immunized with an immunizing agent to elicit lymphocytes that produce or 
capable of producing antibodies that will specifically bind to the immunizing agent. 
Alternatively, the lymphocytes may be immunized in vitro. Preferably, the immunizing 
agent comprises an FcRH. Traditionally, the generation of monoclonal antibodies has 
5 depended on the availability of purified protein or peptides for use as the immunogen. 
More recently DNA based immunizations have shown promise as a way to elicit strong 
immune responses and generate monoclonal antibodies. In this approach, DNA-based 
immunization can be used, wherein DNA encoding a portion of FcRH, preferably the 
N- or C- terminal region, is injected into the host animal according to methods known 
10 in the art. 

Generally, either peripheral blood lymphocytes ("PBLs") are used in methods of 
producing monoclonal antibodies if cells of human origin are desired, or spleen cells or 
lymph node cells are used if non-human mammalian sources are desired. The 
lymphocytes are then fused with an immortalized cell line using a suitable fusing agent, 
1 5 such as polyethylene glycol, to form a hybridoma cell (Goding, "Monoclonal 
Antibodies: Principles and Practice" Academic Press, (1986) pp. 59-103). 
Immortalized cell lines are usually transformed mammalian cells, including myeloma 
cells of rodent, bovine, equine, and human origin. Usually, rat or mouse myeloma cell 
lines are employed. The hybridoma cells may be cultured in a suitable culture medium 
20 that preferably contains one or more substances that inhibit the growth or survival of 
the unfused, immortalized cells. For example, if the parental cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture 
medium for the hybridomas typically will include hypoxanthine, aminopterin, and 
thymidine ("HAT medium"), which substances prevent the growth of HGPRT-deficient 

25 cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable 
high level expression of antibody by the selected antibody-producing cells, and are 
sensitive to a medium such as HAT medium. More preferred immortalized cell lines 
are murine myeloma lines, which can be obtained, for instance, from the Salk Institute 
30 Cell Distribution Center, San Diego, Calif, and the American Type Culture Collection, 
Rockville, Md. Human myeloma and mouse-human heteromyeloma cell lines also 
have been described for the production of human monoclonal antibodies (Kozbor, J. 
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Immunol., 133:3001 (1984); Brodeur et al. f "Monoclonal Antibody Production 
Techniques and Applications" Marcel Dekker, Inc., New York, (1987) pp. 51-63). 

The culture medium in which the hybridoma cells are cultured can then be 
assayed for the presence of monoclonal antibodies directed against an FcRH. 

5 Preferably, the binding specificity of monoclonal antibodies produced by the hybridoma 
cells is determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such 
techniques and assays are known in the art, and are described further in Harlow and 
Lane "Antibodies, A Laboratory Manual" Cold Spring Harbor Publications, New York, 

10 (1988). 

After the desired hybridoma cells are identified, the clones may be subcloned by 
limiting dilution or FACS sorting procedures and grown by standard methods. Suitable 
culture media for this purpose include, for example, Dulbecco's Modified Eagle's 
Medium and RPMI-1640 medium. Alternatively, the hybridoma cells may be grown in 

1 5 vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or 
purified from the culture medium or ascites fluid by conventional immunoglobulin 
purification procedures such as, for example, protein A-Sepharose, hydroxylapatite 
chromatography, gel electrophoresis, dialysis, or affinity chromatography. 

20 The monoclonal antibodies may also be made by recombinant DNA methods, 

such as those described in U.S. Pat No. 4,816,567. DNA encoding the monoclonal 
antibodies of the invention can be readily isolated and sequenced using conventional 
procedures (e.g., by using oligonucleotide probes that are capable of binding 
specifically to genes encoding the heavy and light chains of murine antibodies). The 

25 hybridoma cells of the invention serve as a preferred source of such DNA. Once 

isolated, the DNA may be placed into expression vectors, which are then transfected 
into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, 
plasmacytoma cells, or myeloma cells that do not otherwise produce immunoglobulin 
protein, to obtain the synthesis of monoclonal antibodies in the recombinant host cells. 

30 The DNA also may be modified, for example, by substituting the coding sequence for 
human heavy and light chain constant domains in place of the homologous murine 
sequences (U.S. Pat. No. 4,816,567) or by covalently joining to the immunoglobulin 
coding sequence all or part of the coding sequence for a non-immunoglobulin 
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polypeptide. Such a non-immunoglobulin polypeptide can be substituted for the 
constant domains of an antibody of the invention, or can be substituted for the variable 
" domains of one antigen-combining site of an antibody of the invention to create a ■ 
chimeric bivalent antibody comprising one antigen-combining site having specificity 
5 for FcRH and another antigen-combining site having specificity for a different antigen. 

In vitro methods are also suitable for preparing monovalent antibodies. 
Digestion of antibodies to produce fragments thereof, particularly, Fab fragments, can 
be accomplished using routine techniques known in the art. For instance, digestion can 
be performed using papain. Examples of papain digestion are described in WO 
10 94/29348 published Dec. 22, 1994, U.S. Pat. No. 4,342,566, and Harlow and Lane, 
Antibodies, A Laboratory Manual, Cold Spring Harbor Publications, New York, 
(1988). Papain digestion of antibodies typically produces two identical antigen binding 
fragments, called Fab fragments, each with a single antigen binding site, and a residual 
Fc fragment. Pepsin treatment yields a fragment, called the F(ab') 2 fragment, that has 
1 5 two antigen combining sites and is still capable of cross-linking antigen. 

The Fab fragments produced in the antibody digestion also contain the constant 
domains of the light chain and the first constant domain of the heavy chain. Fab' 
fragments differ from Fab fragments by the addition of a few residues at the carboxy 
terminus of the heavy chain domain including one or more cysteines from the antibody 
20 hinge region. The F(ab') 2 fragment is a bivalent fragment comprising two Fab' 

fragments linked by a disulfide bridge at the hinge region. Fab'-SH is the designation 
herein for Fab' in which the cysteine residue(s) of the constant domains bear a free thiol 
group. Antibody fragments originally were produced as pairs of Fab' fragments which 
have hinge cysteines between them. Other chemical couplings of antibody fragments 

25 are also known. 

An isolated immunogenically specific epitope or fragment of the antibody is 
also provided. A specific immunogenic epitope of the antibody can be isolated from 
the whole antibody by chemical or mechanical disruption of the molecule. The purified 
fragments thus obtained can be tested to determine their immunogenicity and specificity 

30 by the methods taught herein, hnmunoreactive epitopes of the antibody can also be 
synthesized directly. An immunoreactive fragment is defined as an amino acid 
sequence of at least about two to five consecutive amino acids derived from the 
antibody amino acid sequence. 
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One method of producing proteins comprising the antibodies of the present 
invention is to link two or more peptides or polypeptides together by protein chemistry 
techniques. For example, peptides or polypeptides can be chemically synthesized using 
currently available laboratory equipment using either Fmoc (9-fluorenylmethyl- 

5 oxycarbonyl) or Boc (tert -butyloxycarbonoyl) chemistry. (Applied Biosystems, Inc., 
Foster City, CA). One skilled in the art can readily appreciate that a peptide or 
polypeptide corresponding to the antibody of the present invention, for example, can be 
synthesized by standard chemical reactions. For example, a peptide or polypeptide can 
be synthesized and not cleaved from its synthesis resin whereas the other fragment of an 

10 antibody can be synthesized and subsequently cleaved from the resin, thereby exposing 
a terminal group that is functionally blocked on the other fragment By peptide 
condensation reactions, these two fragments can be covalently joined via a peptide bond 
at their carboxyl and amino termini, respectively, to form an antibody, or fragment 
thereof. (Grant GA (1992) Synthetic Peptides: A User Guide. W.H. Freeman and Co., 

15 N.Y. (1992); Bodansky M and Trost B., Ed. (1993) Principles of Peptide Synthesis. 
Springer- Verlag Inc., NY). Alternatively, the peptide or polypeptide can by 
independently synthesized in vivo as described above. Once isolated, these independent 
peptides or polypeptides may be linked to form an antibody or fragment thereof via 
similar peptide condensation reactions. 

20 For example, enzymatic ligation of cloned or synthetic peptide segments can 

allow relatively short peptide fragments to be joined to produce larger peptide 
fragments, polypeptides or whole protein domains (Abrahmsen L et al., Biochemistry, 
30:4151 (1991)). Alternatively, native chemical ligation of synthetic peptides can be 
utilized to synthetically construct large peptides or polypeptides from shorter peptide 

25 fragments. This method consists of a two step chemical reaction (Dawson et al. 

Synthesis of Proteins by Native Chemical Ligation. Science, 266:776-779 (1994)). The 
first step is the chemoselective reaction of an unprotected synthetic peptide-a-thioester 
with another unprotected peptide segment containing an ammo-terminal Cys residue to 
give a thioester-linked intermediate as the initial covalent product. Without a change in 

30 the reaction conditions, this intermediate undergoes spontaneous, rapid intramolecular 
reaction to form a native peptide bond at the ligation site. Application of this native 
chemical ligation method to the total synthesis of a protein molecule is illustrated by the 
preparation of human interleukin 8 (IL-8) (Baggiolini M et al. (1992) FEBS Lett. 



10 
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307:97-101; Clark-Lewis I et al., J.Biol.Chem., 269:16075 (1994); Clark-Lewis I et al., 
Biochemistry, 30:3 128 (1991); Rajarathnam K et al., Biochemistry 33:6623-30 (1994)). 

Alternatively, unprotected peptide segments can be chemically linked where the 
bond formed between the peptide segments as a result of the chemical ligation is an 
5 unnatural (non-peptide) bond (Schnolzer, M et al. Science, 256:221 (1992)). This 
technique has been used to synthesize analogs of protein domains as well as large 
amounts of relatively pure proteins with full biological activity (deLisle Milton RC et 
al., Techniques in Protein Chemistry IV. Academic Press, New York, pp. 257-267 

(1992)). ' 

The invention also provides fragments of antibodies that have bioactivity. The 

polypeptide fragments of the present invention can be recombinant proteins obtained by 
cloning nucleic acids encoding the polypeptide in an expression system capable of 
producing the polypeptide fragments thereof, such as an adenovirus or baculovirus 
expression system. For example, one can determine the active domain of an antibody 
15 from a specific hybridoma that can cause a biological effect associated with the 
interaction of the antibody with FcRH. For example, amino acids found to not 
contribute to either the activity or the binding specificity or affinity of the antibody can 
be deleted without a loss in the respective activity. 

For example, amino or carboxy-terminal amino acids can be sequentially 
20 removed from either the native or the modified non-immunoglobulin molecule or the 
immunoglobulin molecule and the respective activity assayed in one of many available 
assays, hi another example, a fragment of an antibody can comprise a modified 
antibody wherein at least one amino acid has been substituted for the naturally 
occurring amino acid at a specific position, and a portion of either amino terminal or 
25 carboxy terminal amino acids, or even an internal region of the antibody, has been 
replaced with a polypeptide fragment or other moiety, such as biotin, which can 
facilitate in the purification of the modified antibody. For example, a modified 
antibody can be fused to a maltose binding protein, through either peptide chemistry of 
cloning the respective nucleic acids encoding the two polypeptide fragments into an 
30 expression vector such that the expression of the coding region results in a hybrid 
polypeptide. The hybrid polypeptide can be affinity purified by passing it 
amylose affinity column, and the modified antibody receptor can then be separated from 
the maltose binding region by cleaving the hybrid polypeptide with the specific protease 



over an 
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factor Xa. (See, for example, New England Biolabs Product Catalog, 1996, pg. 164.). 
Similar purification procedures are available for isolating hybrid proteins from 

eukaryotic cells as well. 

The fragments, whether attached to other sequences, can also include insertions, 

5 deletions, substitutions, or other selected modifications of particular regions or specific 
amino acids residues, provided the activity of the fragment is not significantly altered or 
impaired compared to the nonmodified antibody or antibody fragment. These 
modifications can provide for some additional property, such as to remove or add 
amino acids capable of disulfide bonding, to increase its bio-longevity, to alter its 

10 secretory characteristics, etc. In any case, the fragment must possess a bioactive 
property, such as binding activity, regulation of binding at the binding domain, etc. 
Functional or active regions of the antibody may be identified by mutagenesis of a 
specific region of the protein, followed by expression and testing of the expressed 
polypeptide. Such methods are readily apparent to a skilled practitioner in the art and 

1 5 can include site-specific mutagenesis of the nucleic acid encoding the antigen. (Zoller 
MJ et al. Nucl. Acids Res. 10:6487-500 (1982). 

As used herein, the phrase "specific binding" or "selective binding" refers to a 
binding reaction which is determinative of the presence of the FcRH in a heterogeneous 
population of proteins and other biologies. Thus, under designated conditions, the 

20 antibodies or fragments thereof of the present invention bind to a particular FcRH (e.g., 
human FcRH 1 or any variant thereof), fragment, or variant thereof and do not bind in a 
significant amount to other proteins (e.g., human FcRH 2, 3, 4, 5, or 6), present in the 
subject. The absence of binding in the present invention is concisderd to be binding that 
is less than 1.5 times background (i.e., the level of non-specific binding or slightly 

25 above non-specific binding levels), 

* 

Selective binding to an antibody under such conditions may require an antibody 
that is selected for its specificity for a particular protein, variant, or fragment. In one 
embodiment the purified antibody selectively binds to the FcRH comprising a 
cytoplasmic region with more than 107 or less than 104 amino acids, a transmembrane 
30 region, and an extracellular region. More specifically, the antibody in alternative 

embodiments selectively binds FcRHl but not FcRH2-6; selectively binds FcRH2 but 
not 1 or 3-6; selectively binds FcRH3 but not FcRHl-2 or 4-6; selectively binds FcRH6 
but not 1-5. Thus, as one embodiment, the antibody selectively binds a polypeptide 
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comprising the amino acid sequence of SEQ ED NO: 1, 21, or 2, or a subset thereof, but 
not to polypeptides comprising the amino acid of SEQ ID NO:3, 22, 4, 5, 23, 24, 6, 25, 
26, 27, 28, or a subset thereof. In another embodiment the purified antibody binds to 
the FcRH comprising the amino acid sequence of SEQ ID NO:3, 22, or 4, but not to the 

5 FcRH comprising the amino acid of SEQ ID NO:l, 21, 2, 5, 23, 24, 6, 25, 26, 27, or 28. 
In yet another embodiment, the purified antibody that binds to the FcRH comprising 
the amino acid sequence of SEQ ID NO:5, 23, 24, or 6, but not to the FcRH comprising 
the amino acid of SEQ IDNO:l, 21, 2, 3, 22, 4, 26, 27, 28. Similarly, the antibodies of 
the present invention may bind only moFcRHl, but not moFcRH 2 or moFcRH3; may 

10 bind only FcRH2 and not FcRHl or FcRH3, and may bind only FcRH3 and not FcRHl 
or FcRH2. 

In certain embodiments, the antibody binds the extracellular region of one or 
more FcRHs and in other embodiments the antibody binds the cytoplasmic region of 
one or more FcRHs. In other embodiments the antibody may selectively bind one 
15 isoform of a FcRH. For example, the antibody may bind a polypeptide having the 

amino acid sequence of SEQ ID NO:23 but not the SEQ ID NO:24 or vice versa. 
Furthermore, the antibody can bind to moFcRHl having the amino acid sequence of 
SEQ ID NO:70, but not to a moFcRHl having amino acid sequence of SEQ ID NO:68. 
The antibody may selectively bind a moFcRH2 with a transmembrane region (e.g., 
20 having amino acid sequence of SEQ ID NO:73), but not bind to a moFcRffi lacking a 
transmembrane region (e.g., having the amino acid sequence of 77). Optionally the 
antibody of the invention can selectively bind moFcRH but not human, or vice versa. 

A variety of immunoassay formats may be used to select antibodies that 
selectively bind with a particular protein, variant, or fragment. For example, solid- 
25 phase ELISA immunoassays are routinely used to select antibodies selectively 

immunoreactive with a protein, variant, or fragment thereof. See Harlow and Lane. 
Antibodies, A Laboratory Manual. Cold Spring Harbor Publications, New York, 
(1988), for a description of immunoassay formats and conditions that could be used to 
determine selective binding. The binding affinity of a monoclonal antibody can, for 
30 example, be determined by the Scatchard analysis of Munson et aL, Anal. Biochem., 
107:220 (1980). 

The invention also provides an antibody reagent kit comprising the antibody or 
fragment thereof of the invention and reagents for detecting binding of the antibody or 
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fragment thereof to a ligand. The kit can further comprise containers containing the 
antibody or fragment thereof of the invention and containers containing the reagents. 
Preferably the ligand is a FcRH, variant, or fragment thereof. Particularly, the kit can 
detect the presence of one or more FcRHs specifically reactive with the antibody or an 
5 immunoreactive fragment thereof. The kit can include an antibody bound to a 

substrate, a secondary antibody reactive with the antigen and a reagent for detecting a 
reaction of the secondary antibody with the antigen. Such a kit can be an ELISA kit and 
can comprise the substrate, primary and secondary antibodies when appropriate, and 
any other necessary reagents such as detectable moieties, enzyme substrates and color 

10 reagents as described above. The diagnostic kit can, alternatively, be an immunoblot 
kit generally comprising the components and reagents described herein. Alternatively, 
the kit could be a radioimmunoassay kit, a Western blot assay kit, an 
immunohistological assay kit, an immunocytochemical assay kit, a dot blot assay kit, a 
fluorescence polarization assay kit, a scintillation proximity assay kit, a homogeneous 

1 5 time resolved fluorescence assay kit, or a BIAcore analysis kit. 

As used throughout, methods of detecting an FcRH or antigen/antibody 
complexes, including complexes comprising an FcRH and optionally the antibody of 
the present invention, can comprise an ELISA (competition or sandwich), a 
radioimmunoassay, a Western blot assay, an immunohistological assay, an 

20 immunocytochemical assay, a dot blot assay, a fluorescence polarization assay (Jolley 
(1981); Jiskoot et al (1991); Seethala et al. (1998); Bicamumpaka et al. (1998)), a 
scintillation proximity assay (Amersham Life Science (1995) Proximity News. Issue 17; 
Amersham Life Science (1995) Proximity News. Issue 18; Park et al. (1999)),' a 
homogeneous time-resolved fluorescence assay (Park et al. (1999); Stenroos et al. 

25 (1988); Morrison, 1988)), or a BIAcore analysis Fagerstam et al. (1992) 

Chromatography 597:397-410. Preferably, the antigen/antibody complex is detectably 
tagged either directly or indirectly. Any desired tag can be utilized, such as a 
fluorescent tag, a radiolabel, a magnetic tag, or an enzymatic reaction product. 

Optionally, the antibody or fragment is a humanized antibody or a fully human 

30 antibody. For example, the antibodies can also be generated in other species and 

"humanized" for administration to humans. Alternatively, fully human antibodies can 
also be made by immunizing a mice or other species capable of making a fully human 
antibody (e.g., mice genetically modified to produce human antibodies), screening 
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clones that bind FcRH. See, e.g., Lonberg and Huszar (1995) Human antibodies from 
transgenic mice, Int. Rev. Immunol. 13:65-93, which is incorporated herein by 
reference in its entirety for methods of producing fully human antibodies. As used 
herein, the term "humanized" and "fully human" in relation to antibodies, relate to any 
5 antibody which is expected to elicit a therapeutically tolerable weak immunogenic 

response in a human subject. 

Humanized forms of non-human (e.g., murine) antibodies are chimeric 
immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab' , 
F(ab% or other antigen-binding subsequences of antibodies) which contain minimal 
1 0 sequence derived from non-human immunoglobulin. Humanized antibodies include 
human immunoglobulins (recipient antibody) in which residues from a complementary 
determining region (CDR) of the recipient are replaced by residues from a CDR of a 
non-human species (donor antibody) such as mouse, rat or rabbit having the desired 
specificity, affinity and capacity. In some instances, Fv framework residues of the 
1 5 human immunoglobulin are replaced by corresponding non-human residues. 
Humanized antibodies may also comprise residues that are found neither in the 
recipient antibody nor in the imported CDR or framework sequences, hi general, the 
humanized antibody will comprise substantially all or at least one, and typically two, 
variable domains, in which all or substantially all of the CDR regions correspond to 
20 those of a non-human immunoglobulin and all or substantially all of the FR regions are 
those of a human immunoglobulin consensus sequence. The humanized antibody 
optimally also will comprise at least a portion of an immunoglobulin constant region 
(Fc), typically that of a human immunoglobulin (lones et al., Nature, 321:522-525 
(1986); Riechmann et al., Nature, 332:323-327 (1988); and Presta, Curr. Op. Struct. 
25 Biol., 2:593-596 (1992)). 

Methods for humanizing non-human antibodies are well known in the art. 
Generally, a humanized antibody has one or more amino acid residues introduced into it 
from a source that is non-human. These non-human amino acid residues are often 
referred to as "import" residues, which are typically taken from an "import" variable 
30 domain. Humanization can be essentially performed following the method of Winter 
and co-workers (Jones et al., Nature, 321:522-525 (1986); Riechmann et al., Nature, 
332:323-327 (1988); Verhoeyen et al., Science, 239:1534-1536 (1988)), by substituting 
rodent CDRs or CDR sequences for the corresponding sequences of a human antibody. 
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Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Pat. No. 
4,816,567), wherein substantially less than an intact human variable domain has been 
substituted by the corresponding sequence from a non-human species. In practice, 
humanized antibodies are typically human antibodies in which some CDR residues and 
5 possibly some FR residues are substituted by residues from analogous sites in rodent 
antibodies. 

The choice of human variable domains, both light and heavy, to be used in 
making the humanized antibodies is very important in order to reduce antigenicity. 
According to the "best-fit" method, the sequence of the variable domain of a rodent 
1 0 antibody is screened against the entire library of known human variable domain 

sequences. The human sequence which is closest to that of the rodent is then accepted 
as the human framework (FR) for the humanized antibody (Sims et al., J. Immunol., 
151:2296 (1993) and Chothia et al., J. Mol. Biol., 196:901 (1987)). Another method 
uses a particular framework derived from the consensus sequence of all human 
1 5 antibodies of a particular subgroup of light or heavy chains. The same framework may 
be used for several different humanized antibodies {Carter et al, Proc. Natl. Acad. Sci. 
USA, 89:4285 (1992); Presta et al., J. Immunol., 151:2623 (1993)). 

It is further important that antibodies be humanized with retention of high 
affinity for the antigen and other favorable biological properties. To achieve this goal, 
20 according to a preferred method, humanized antibodies are prepared by a process of 
analysis of the parental sequences and various conceptual humanized products using 
three dimensional models of the parental and humanized sequences. Three dimensional 
immunoglobulin models are commonly available and are familiar to those skilled in the 
art. Computer programs are available which illustrate and display probable three- 
25 dimensional conformational structures of selected candidate immunoglobulin 
sequences. Inspection of these displays permits analysis of the likely role of the 
residues in the functioning of the candidate immunoglobulin sequence, i.e., the analysis 
of residues that influence the ability of the candidate immunoglobulin to bind its 
antigen. In this way, FR residues can be selected and combined from the consensus and 
30 import sequence so that the desired antibody characteristic, such as increased affinity 
for the target antigen(s), is achieved. In general, the CDR residues are directly and 
most substantially involved in influencing antigen binding (see, WO 94/04679 
published 3 Mar. 1994). 
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Transgenic animals (e.g., mice) that are capable, upon immunization, of 
producing a full repertoire of human antibodies in the absence of endogenous 
immunoglobulin production can be employed. For example, it has been described that 
the homozygous deletion of the antibody heavy chain joining region (J(fl)) gene in 

5 chimeric and germ-line mutant mice results in complete inhibition of endogenous 
antibody production. Transfer of the human germ-line immunoglobulin gene array in 
such germ-line mutant mice will result in the production of human antibodies upon 
antigen challenge (see, e.g., Jakobovits et al., Proc. Natl. Acad. Sci. USA, 90:2551-255 
(1993); Jakobovits et al., Nature, 362:255-258 (1993); Bruggemann et al, Year in 

10 hnmuno., 7:33 (1993)). Human antihodies can also be produced in phage display 

libraries (Hoogenboom et al., J. Mol. Biol., 227:381 (1991); Marks et al., J. Mol. Biol., 
222:581 (1991)). The techniques of Cote et al. and Boemer et al. are also available for 
the preparation of human monoclonal antibodies (Cole et al., Monoclonal Antibodies 
and Cancer Therapy, Alan R. Liss, p. 77 (1985); Boemer et al., J. Immunol., 147(1):86- 

15 95 (1991)). 

In one embodiment, the antibody or fragment thereof is a single chain antibody. 
In another embodiment, the antibody or fragment is labeled. Optionally the antibody or 
fragment is conjugated or fused with a toxin or fragment thereof. Examples of the toxin 
or toxin moiety include diphtheria, ricin, and modifications thereof. 



20 

t 



Diagnosis and Treatment 

The invention provides uses of the reagents described herein in in vitro and in 
vivo methods of diagnosing and treating a malignancy of hematopoietic cell lineage or 
an autoimmune disease in a subject. The reagents of the present invention are also 
25 useful in screening for disease manifestations. Such screening may be useful even 
before the onset of other clinical symptoms and could be used to screening subjects at 
risk for disease, so that prophylactic treatment can be started before the manifestation of 

other signs or symptoms. 

By "malignancy" is meant a tumor or neoplasm whose cells possess one of more 

30 nuclear or cytoplasmic abnormalities, including, for example, high nuclear to 

cytoplasmic ratio, prominent nucleolar/nucleoli variations, variations in nuclear size, 
abnormal mitotic figures, or multinucleation. "Malignancies of hematopoietic cell . 
lineage" include, but are not limited to, myelomas, leukemias, lymphomas (Hodgkin's 
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and non-Hodgkin's forms), T-cell malignancies, B-cell malignancies, and 
lymphosarcomas or other malignancies described in the REAL classification system or 
the World Health Organization Classification of Hematologic Malignancies. It should 
be noted that the absence or presence of specific FcRHs can be diagnostic for a 
5 particular malignancy of hematopoietic cell linage or can be diagnostic for a particular 
form of a malignancy (e.g., a specific form of leukemia). 

By "inflammatory and autoimmune diseases" illustratively including systemic 
lupus erythematosus, Hashimoto's disease, rheumatoid arthritis, graft-versus-host 
disease, Sjogren's syndrome, pernicious anemia, Addison disease, scleroderma, 
10 Goodpasture's syndrome, Crohn's disease, autoimmune hemolytic anemia, sterility, 
myasthenia gravis, multiple sclerosis, Basedow's disease, thrombopenia purpura, 
insulin-dependent diabetes mellitus, allergy; asthma, atopic disease; arteriosclerosis; 
myocarditis; cardiomyopathy; glomerular nephritis; hypoplastic anemia; rejection after 
organ transplantation and numerous malignancies of lung, prostate, liver, ovary, colon, 
1 5 cervix, lymphatic and breast tissues. 

Specifically, the diagnostic methods comprise the steps of contacting a 
biological sample of the subject with an antibody or nucleic acid of the invention under 
conditions that allow the antibody to bind to cells of hematopoietic cell lineage or allow 
the nucleic acid to hybridize, preferably under stringent conditions, with nucleic acids 
20 of the biological sample; and detecting the amount or pattern of binding. Changes in 
the amount or pattern of binding as compared to binding in a control sample indicate a 
malignancy or an inflammatory or autoimmune disease. 

hi various embodiments, the antibody used in the diagnostic method can 
selectively bind with an FcRH having the amino acid sequence of SEQ ID NO:l, 21,2, 

25 3, 22, 4, 5, 24, or 6. 

The detecting step of the diagnostic method can be selected from methods 
routine in the art. For example, the detection step can be performed in vivo using a 
noninvasive medical technique such as radiography, fluoroscopy, sonography, imaging 
techniques such as magnetic resonance imaging, and the like. In vitro detection 
30 methods can be used to detect bound antibody or fragment thereof in an ELISA, RIA, 
immunohistochemically, FACS, IHC, FISH, or similar assays. 

As used throughout, "biological sample" refers to a sample from any organism. 
The sample can be, but is not limited to, peripheral blood, plasma, urine, saliva, gastric 
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secretion, feces, bone marrow specimens, primary tumors, embedded tissue sections, 
frozen tissue sections, cell preparations, cytological preparations, exfoliate samples 
(e.g., sputum), fine needle aspirations, amnion cells, fresh tissue, dry tissue, and 
cultured cells or tissue. It is further contemplated that the biological sample of this 
5 invention can also be whole cells or cell organelles (e.g., nuclei). The sample can be 
unfixed or fixed according to standard protocols widely available in the art and can also 
be embedded in a suitable medium for preparation of the sample. For example, the 
sample can be embedded in paraffin or other suitable medium (e.g., epoxy or 
acrylamide) to facilitate preparation of the biological specimen for the detection 
10 methods of this invention. 

The invention also provides a method of treating a malignancy of hematopoietic 
cell lineage or an inflammatory or autoimmune disease in a subject, comprising 
contacting the subject's malignant cells or inflammatory cells with a therapeutically 
effective amount of a reagent (e.g., an antibody or nucleic acid) or a therapeutic 
1 5 composition of a reagent of the invention. The contacting step can occur by 

administration of the reagent or composition using any number of means available in 
the art. Typically, the reagent or composition is administered to the subject 
transdermals (e.g., by a transdermal patch or a topically applied cream, ointment, or the 
like), orally, subcutaneously, intrapulmonaryily, transmucosally, intraperitoneally, 
20 intrauterinely, sublingual^, intrathecally, intramuscularly, intraarticularly, etc. using 
conventional methods. In addition, the reagent or composition can be administered via 
injectable depot routes such as by using 1-, 3-, or 6-month depot injectable or 

biodegrable materials and methods. 

Regardless of the route of administration, the amount of the reagent 

25 administered or the schedule for administration will vary among individuals based on 
age, size, weight, condition to be treated, mode of administration, and the severity of 
the condition. One skilled in the art will realize that dosages are best optimized by the 
practicing physician and methods for determining dosage are described, for example in 
Remington's Pharmaceutical Science, latest edition. Guidance in selecting appropriate 

3 0 doses for antibodies is found in the literature on therapeutic uses of antibodies, e.g., 

Handbook of Monoclonal Antibodies, Ferrone et al., eds„ Noges Publications, Park Ridge, 
N.J., (1985) ch. 22 and pp. 303-357; Smith et al., Antibodies in Human Diagnosis and 
Therapy, Haber et al., eds., Raven Press, New York (1977) pp. 365-389. A typical dose of 
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the antibody used alone might range from about 1 tig/kg to up to 100 mg/kg of body weight 
or more per day, and preferably 1 ng/kg to up to 1 mg/kg, depending on the factors 
mentioned above. An intravenous injection of the antibody or fragment thereof, for 
example, could be lOng-lg of antibody or fragment thereof, and preferably lOng-lmg 

5 depending on the factors mentioned above. For local injection, a typical quantity of 

antibody ranges from Ipg tolmg . Preferably, the local injection would be at an antibody 
concentration of 1-100 jig/ml, and preferably 1-20 ^ig/ml. 

The nucleic acids of the invention can delivered to cells in a variety of ways. 
For example, if the nucleic acid of this invention is delivered to the cells of a subject in 

10 an adenovirus vector, the dosage for administration of adenovirus to humans can range 
from about 1 0 7 to 10 9 plaque forming units (pfu) per injection, but can be as high as 
10 12 pfu per injection. Ideally, a subject will receive a single injection. If additional 
injections are necessary, they can be repeated at six month intervals for an indefinite 
period and/or until the efficacy of the treatment has been established. As set forth 

1 5 herein, the efficacy of treatment can be determined by evaluating the clinical 
parameters. 

The exact amount of the nucleic acid or vector required will vary as described 
above. Thus, it is not possible to specify an exact amount for every nucleic acid or 
vector. An appropriate amount can be determined by one of ordinary skill in the art 

20 using only routine experimentation given the teachings herein. 

The invention further provides a therapeutic composition of the reagent of the 
invention. Such a composition typically contains from about 0. 1 to 90% by weight 
(such as 1 to 20% or 1 to 10%) of a therapeutic agent of the invention in a 
pharmaceutical^ acceptable carrier. Solid formulations of the compositions for oral 

25 administration may contain suitable carriers or excipients, such as corn starch, gelatin, 
lactose, acacia, sucrose, microcrystalline cellulose, kaolin, mannitol, dicalcium 
phosphate, calcium carbonate, sodium chloride, or alginic acid. Disintegrators that can 
be used include, without limitation, microcrystalline cellulose, corn starch, sodium 
starch, glycolate, and alginic acid. Tablet binders that may be used include acacia, 

30 methylcellulose, sodium carboxymethylcellulose, polyvinylpyrrolidone (Povidone ), 
hydroxypropyl methylcellulose, sucrose, starch, and ethylcellulose. Lubricants that may 
be used include magnesium stearates, stearic acid, silicone fluid, talc, waxes, oils, and 
colloidal silica. 
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Liquid formulations for oral administration prepared in water or other aqueous 
vehicles may contain various suspending agents such as methylcellulose, alginates, 
tragacanth, pectin, kelgin, carrageenan, acacia, polyvinylpyrrolidone, and polyvinyl 
alcohol. The liquid formulations may also include solutions, emulsions, syrups and 
5 elixirs containing, together with the active compound(s), wetting agents, sweeteners, 
and coloring and flavoring agents. Various liquid and powder formulations can be 
prepared by conventional methods for inhalation into the lungs of the mammal to be 
treated. 

Injectable formulations of the compositions may contain various carriers such as 
10 vegetable oils, dimethylacetamide, dimethylformamide, ethyl lactate, ethyl carbonate, 
isopropyl myristate, ethanol, polyols (glycerol, propylene glycol, liquid polyethylene 
glycol, and the like) . For intravenous injections, water soluble version of the 
compounds may be administered by the drip method, whereby a pharmaceutical 
formulation containing the antifungal agent and a physiologically acceptable excipient 
1 5 is infused. Physiologically acceptable excipients may include, for example, 5% 

dextrose, 0.9% saline, Ringer's solution or other suitable excipients. Intramuscular 
preparations, e.g., a sterile formulation of a suitable soluble salt form of the compounds, 
can be dissolved and administered in a pharmaceutical excipient such as water-for- 
injection, 0.9% saline, or 5% glucose solution. A suitable insoluble form of the 
20 compound may be prepared and administered as a suspension in an aqueous base or a 
pharmaceutically acceptable oil base, such as an ester of a long chain fatty acid (e.g., 
ethy; oleate). 

A topical semi-solid ointment formulation typically contains a concentration of 
the active ingredient from about 1 to 20%, e.g., 5 to 10%, in a carrier such as a 
25 pharmaceutical cream base. Various formulations for topical use include drops, 

tinctures, lotions, creams, solutions, and ointments containing the active ingredient and 
i various supports and vehicles. The optimal percentage of the therapeutic agent in each 
pharmaceutical formulation varies according to the formulation itself and the 
therapeutic effect desired in the specific pathologies and correlated therapeutic 
30 regimens. 

The effectiveness of the method of treatment can be assessed by monitoring the 
patient for known signs or symptoms of the conditions being treated. For example, in 
the treatment of a malignancy of hematopoietic cell lineage, the reduction or 
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stabilization of the number of abnormally proliferative cells would indicate successful 
treatment. In the treatment of arthritis, for example, a reduction in the amount of joint 
inflammation would indicate successful treatment. Thus, by"therapeutically effective" 
is meant an amount that provides the desired treatment effect. 
5 The invention further provides a method of modulating a humoral immune 

response in a subject, comprising administering to the subject an isolated FcRH, an 
antibody, or nucleic acid of the invention. By "modulation" is meant either up- 
regulating or down-regulating. Thus, in the case of an allergic response, one skilled in 
the art would choose to down-regulate the humoral immune response. In the case of 
10 exposure of a subject to an infectious agent (e.g., a viral or bacterial agent), one skilled 
in the art would choose to upregulate the humoral antibody response. 



Experimental 

The 'following examples are put forth so as to provide those of ordinary skill in 
15 the art with a complete disclosure and description of how the compounds, 

compositions, articles, devices and/or methods claimed herein are made and evaluated, 
and are intended to be purely exemplary of the invention and are not intended to limit 
the scope of what the inventors regard as their invention. Efforts have been made to 
ensure accuracy with respect to numbers (e.g., amounts, temperature, etc.), but some 
20 errors and deviations should be accounted for. Unless indicated otherwise, parts are 
parts by weight, temperature is in °C or is at ambient temperature, and pressure is at or 
near atmospheric. 



EXAMPLE 1 

25 Identification of FcRH L FcRH2. and FcRH3 

In order to isolation of FcRH cDNA Clones, rapid amplification of cDNA ends 
(RACE)-PCR was performed by using a Marathon-Ready human lymph node cDNA 
library (CLONTECH). Gene-specific primers were as follows: FcRH3, forward 5 1 - 
TGAGTCTCAGGGTCACAGTTCCG-3* (SEQ ID NO:41) and reverse 5'- 

30 GCTCTTGAACTTGG ATATTTAGGGGT-3 1 (SEQ ID NO:42); FcRH2, forward 
S'-CCAGTGTATGTCAATGTGGGCTCTG^' (SEQ ID NO:43) and reverse 5»- 
CGTTGAAAG AGCTCTTGGACTTTTATC-3 ' (SEQ ID NO:44); and FcRHl, forward 
S'-GCCTCAAAAGAAAAATAGGAAGACGTT-S' (SEQ ID NO:45) and reverse 5*- 
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AAGCTC AC ATC AGCG AC AGGGAC-3 ' (SEQ ID NO:46). RACE products were 
subjected to a second round of nested PCR and visualized by agarose gel 
electrophoresis and ethidium bromide staining. 

Primers used in end-to-end amplification to generate full-length cDNAs were as 
5 follows: FcRH3, forward 5'-TCTTGGAGATAAGTCGGGCTTT-3' (SEQ JD NO:47) 
and reverse 5-ATCCTGCAGCCCAGCCTCGTAGGAG-3' (SEQ ID NO:48); FcRH2, 
forward 5'-GGTCCTCATGCTGCTGTGGTCATT-3' (SEQ ID NO:49)and reverse 5'- 
GCTGTTGATCTTCCCTTCTGATTC-3 1 (SEQ ID NO:50); and FcRHl, forward 5'- 
ATGCTGCCGAGGCTGTTGCTGTTG3 1 (SEQ ID NO:51) and reverse 5'- 
10 CATAGCATCTTCATAGTCCACATC-3' (SEQ ID NO:52). Each amplification 
reaction underwent initial denaturation of 94°C for 30 s followed by 30 cycles of 
denaturation at 94°C for 5 s and annealing at 68°C for 4 min, and final extension at 
72°C for 6 min. 

PCR products were ligated into the pCR2. 1 TOPO T/A vector (Invitrogen). 
1 5 Inserts were DNA-sequenced on both strands by the dideoxy chain termination method 
using Thermo Sequenase (Amersham Pharmacia) and an automated sequencer (Li-Cor, 
Lincoln, NE). Nucleotide and amino acid sequence alignment was analyzed with a 
DNASTAR (Madison, WI) software package, and homology searches were performed 
by using BLAST (Altschul, S. F. et al. (1990) J. Mol. Biol. 215, 403-410). 
20 RNA blot analysis was subsequently performed. Northern blots (CLONTECH) 

were hybridized with 32P-dCTP-labeled probes: a 528-bp EcoRI fragment 
corresponding to the 5' untranslated (UT)-EC1 regions of the FcRH3 cDNA, a 200-bp 
PCR product corresponding to a portion of the 3' UT region of the FcRH2 cDNA, and a 
257-bp PCR product corresponding to a portion of the 3* UT region of the FcRHl 
25 cDNA. Membranes were hybridized for 1 h at 65°C, washed, and exposed to x-ray film 
(Kubagawa, H. et al. (1997) Proc. Natl. Acad. Sci. USA 94, 5261-5266). 

Reverse transcription (RT)-PCR was performed. Human tonsillar cells, obtained 
with Institutional Review Board approval, were separated into CD19+ and CD19 
subpopulations by magnetic cell sorting (Miltenyi Biotec, Auburn, CA). Viable CD19+ 
in ™iic ™,r« stoned with FITC-labeled anti-CD38 (hnmunotech, Westbrook, ME) and 



phycoerythrin-labeled anti-IgD mAbs (Southern Biotechnology Associates; oeiore 
sorting cells with a FACStarPlus instrument (Becton Dickinson) into Trizol reagent 
(Life Technologies, Grand Island, NY) for RNA isolation. Total cellular RNA was 
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primed with random hexamers and oligo(dT) primers and reverse-transcribed with 
Superscript II (Life Technologies) into single-stranded cDNA. RT-PCR was performed 
by using RNA from tonsillar B cells and cell lines, with GIBCO/BRL Taq polymerase 
(Life Technologies). The following gene-specific primer pairs were used in the RT- 
5 PCR analysis of FcRHl-5 expresion in cell lines and tonsillar B cell subpopulations: 
FcRHl forward, 5'-CTC AAC TTC ACA GTG CCT ACT GGG-3' (SEQ ID NO:53) 
and reverse, 5'-TCC TGC AGA GTC ACT AAC CTT GAG-3' (SEQ ID NO: 54); 
FcRH2 forward, 5'-CCA GTG TAT GTC AAT GTG GGC TCT G (SEQ ID NO:55) 
and reverse, 5'-CAT TCT TCC CTC AAA. TCT TTA CAC-3' (SEQ ID NO:56); FcRH3 

10 forward, 5'-CAG CAC GTG GAT TCG AGT CAC-3' (SEQ ID NO:57) and reverse, 5'- 
CAG ATC TGG GAA TAA ATC GGG TTG-3' (SEQ ID NO:58) FcRH4 forward, 5'- 
TCT TCA GAG ATG GCG AGG TCA-3' (SEQ ID NO:59) and reverse, 5*-TTT TGG 
GGT GTA CAT CAA CAT ACA AG-3' (SEQ ID NO:60); and FcRH forward, 5'-TGT 
TGC CCT GTT TCT TCC AAT ACA-3' (SEQ ID NO:61) and reverse, 5'-CAG AGT 

1 5 TGG CCG ACC TAC GC-3' (SEQ ID NO:62). Each amplification reaction underwent 
initial denaturation at 94° for 5 min followed by 35 cycles of denaturation at 94° for 30 
s, annealing at 60° for 30 s, extension at 72° for 1 min, and final extension at 72° for 7 
min. Amplified products were visualized in 1 % agarose gels containing ethidium 
bromide and documented with the Bio-Rad Fluor-S Imager. 

r 

20 The following human cell lines were used: REH and Nairn 16 pro-B cell lines 

(Korsmeyer, S. J. et al. (1983) J. Clin. Invest 71, 301-313); 697, 207, and OB5 pre-B 
cell lines (Findley, H. W. et al. (1982) Blood 60, 1305-1309; Martin, D. et al. (1991) J. 
Exp. Med. 173, 639-645); Ramos, Daudi, and Raji B cell lines (Pulvertaft, R. J. V. 
(1964) Lancet 1, 238-240; Klein, E. et al. (1968) Cancer Res. 28, 1300-1310; Klein, G. 

25 et al. (1975) Intervirology 5, 319-33431-33); THP-1 and U937 monocytoid cell lines, 
HL-60 promyelocyte and KG-1 myelocytic cell lines, Jurkat T cell line and the K562 
erythroid cell line (American Type Culture Collection). 

A consensus sequence was generated that corresponds to the GenBank-derived 
amino terminal sequences of the second Ig-like domains of 

30 FcR (FcyRI and FcyRH/III) and the third Ig-like domain of the polymeric Ig 

receptor: GEPIXLRCHS WKDKXLXKVTYXQNGKAX^ (SEQ ID NO:63). A 
search of the National Center for Biotechnology Information protein database with this 
sequence identified two overlapping human genomic bacterial artificial chromosome 
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(BAC) clones, AL135929 and AL356276, which are located at lq21.2-22. The second 
clone contained three putative Ig superfamily genes encoding complementary amino 
acid sequences that were designated FcRHl , FcRH2, and FcRH3 . See Figure 1 . The 
predicted amino acid sequences of these gene segments shared 23-57% identity with 
5 each other and 14-28% identity with human FcyRI (CD64). Further analysis of the 
FcRH locus led to the identification of two additional genes (FcRH4, and FcRH5) and 
one pseudogene (FcRH4y), immediately centromeric of FcRHl-3, two of which have 
recently heen described as IRTA1 (FcRH4) and IRTA2 (FcRH5) (Hatzivassiliou, G. et 

al. (2001) Immunity 14, 277-289). 
1 o To determine whether these genes are expressed by lymphocytes, the predicted 

amino acid sequences of their protein products were used to search the Lymphochip 
expressed sequence tag database with the TBLASTN algorithm (Alizadeh, A. A. et al. 
(2000) Nature (London) 403, 503-5 1 1). Two expressed sequence tags (AA505046 and 
AA282433) were identified that share complete identity over 23 amino acids in their 
15 translated ORFs with the N terminus of FcRH 1. Lymphochip microarray data analysis 
indicated that these expressed sequence tags are expressed at relatively high levels in 
peripheral lymphoid tissues, including the lymph nodes, tonsils, resting peripheral B 
cells, and normal germinal center (GC) B cells. Among the different lymphoid 
malignancies, their expression proved to be highest in chronic lymphocytic leukemias, 
20 follicular lymphomas, and some diffuse large cell lymphomas of B lineage. 

FcRHl, FcRH2, and FcRH3 cDNAs were isolated by RACE-PCR from a 
human lymph node cDNA library in both 5 1 and 3' directions. Full-length cDNAs of the 
coding regions for FcRHl, FcRH2, and FcRH3 were obtained by end-to-end PCR using 
unique primers generated from the cDNA sequences delineated for the 5' UT and 3XJT 
25 regions. Southern blot analysis of human genomic DNA digested with BamBL, EcoW, 
or HindSL using cDNA probes specific for the 3' UT regions of each 
cDNA revealed either one or two hybridizing fragments, suggesting that FcRHl, 
FcRH2, and FcRH3 are encoded by single genes. Analysis of full-length cDNA 
sequences indicated that FcRHl, FcRffi, and FcRH3 have ORFs of 1,287 bp, 1,524 bp, 
30 and 2,202 bp, respectively, and encode type I transmembrane proteins of 429 aa, 508 aa, 
and 734 aa, respectively. Based on predicted consensus signal peptide cleavage sites 
(Von Heijne, G. (1986) Nucleic Acid Res. 14, 4683-4690; Nielsen, H. (1997) Protein 
Eng. 10, 1-6), the relative core peptide molecular masses were estimated as 45,158 for 
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FcRHl, 53,407 for FcRH2, and 78,849 for FcRH3. These type I transmembrane 
proteins possess 3-6 extracellular C2 (Williams, A. F. & Barclay, A. N. (1988) Annu. 
Rev. Immunol. 6, 381-405; Bork, P. et al. (1994) J. Mol. Biol. 242, 309-320; Vaughn, 
D. E. & Bjorkman, P. J. (1996) Neuron 16, 261-273) type Ig-like domains with 3-7 
5 potential N-linked glycosylation sites, uncharged transmembrane segments, and 

relatively long cytoplasmic tails containing consensus motifs for ITIMs and/or ITAMs. 
See Fig. 2A. 

Multiple alignment analysis of the translated cDNAs, using FcRH3 as the index 
sequence of comparison, indicates that FcRHl, FcRH2, and FcRH3 have highly 

1 0 conserved hydrophobic signal peptides and corresponding Ig-like extracellular domains 
(Fig. 2B). Their hydrophobic transmembrane (uncharged with the exception of FcRHl 
which includes an acidic domain) domains (Sonnhammer, E. L. L. et al. (1998) in A 
Hidden Markov Model for Predicting Transmembrane Helices in Protein Sequences, 
eds. Glasgow, J., Littlejohn, T., Major, F., Lathrop, R., Sankoff, D. & Sensen, C. (Am. 

1 5 Assoc. for Artificial Intelligence, Menlo Park, CA), pp. 175-1 82) are also well 

conserved, but their cytoplasmic domains are not. FcRHl has a long cytoplasmic tail 
containing three potential ITAMs, the first and third of which fit the consensus 
sequence (E/D)-X-X-Y-X-X-(L/I)-X 6 . 8 -Y-X-X-(L/I) (SEQ ID NO:64, with six amino 
acid between the consensus sequences; SEQ ID NO:65, with seven amino acid residues 

20 between the consensus sequences; and SEQ ID NO:66, with eight amino acid residues 
between the consensus sequences), whereas, the second has only one tyrosine residue. 
The shorter cytoplasmic domain of FcRH2 contains one potential ITAM and two ITIM 
consensus sequences (W/L/S)-X-Y-X-X-(L/V) (SEQ ID NO:67) separated by 22 
amino acids. FcRH3 has the longest cytoplasmic tail. It contains one potential ITAM, 

25 one ITIM, and another potential ITAM that also has a single tyrosine residue. 

An RNA blot analysis with gene-specific probes was performed on 16 human 
tissues, including six primary or secondary lymphoid tissues. RNA blots were analyzed 
with discriminating a 32 P-dCTP-labeled probes generated from the respective FcRH 
cDNAs. The following probes were used: (Top) a PCR-generated, 257-bp probe 

30 specific to the 3' UT region of FcRHl; (Middle) a PCR-generated, 290-bp probe 

corresponding to the 3' UT region of FcRH2; and (Bottom) a 528-bp £coRI-digested 
fragment of the 5' end of the FcRH3 cDNA corresponding to its 5' UT region, SI, S2, 
and EC1 domains. The relative mRNA abundance was indicated by p-actin probe. All 
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three FcRH gene probes hybridized with transcripts in the secondary lymphoid organs, 
spleen and lymph node. An FcRHl -specific probe hybridized with spleen and lymph 
node transcripts of about 3.5 kb and about 6.0 kb. Additional hybridization bands of 
about 0.7 kb and about 1 .5 kb were observed for heart, skeletal muscle, kidney, liver, 
5 and, in less abundance, placental tissue. Larger transcripts also were seen in skeletal 
muscle (about 6.0 kb) and in kidney and placenta (about 4.4 kb). An FcRH2-specific 
probe hybridized to about 3.0-kb, about 4.4-kb, and about 5.5-kb transcripts most 
abundantly in spleen and lymph node. A transcript of approximately 2.4-kb was notable 
in the kidney. An FcRH3 probe hybridized with about 3.5-kb, about 5.5-kb, and about 
10 7.0-kb transcripts chiefly in spleen and lymph node. These also were seen, albeit in 

lesser abundance, in peripheral blood lymphocytes, thymus, and bone marrow samples. 
Additionally, a unique transcript of about 1 .3 5 kb was evident in skeletal muscle. These 
results indicated expression of FcRHl, FcRH2, and FcRH3 in peripheral lymphoid 
organs, whereas tissue specific differences in alternative splicing or polyadenylation 
1 5 were suggested by the differential expression of transcripts with variable size in 

nonlymphoid tissues. RTPCR analysis to date of non-lymphoid tissue skeletal muscle, 
however, does not reveal transcripts despite the Northern analysis results. 

When FcRH expression was examined by RT-PCR analysis of cell lines 
representing different hematopoietic lineages, FcRHl, FcRH2, andFcRH3 expression 
20 was found in every mature B cell line tested (Table 2). FcRH2 and FcRH3 expression 
was limited to the mature B cell lines and not seen in the other types of cells examined. 
In contrast, FcRHl expression was seen in pro-B, T, and myeloid cell lines, although 
not in an erythroid cell line. 



25 



Table 2. Expression of FcRH transcripts in human B cell lines 



Cell Type 


Cell line 


FcRHl 
- 


FcRH2 


FcRH3 


Pro-B 


REH 










Nairn 16 


+ 






Pre-B 


697 


mm 








207 






mm 




OB5 






mm 


B 


Ramos 


+ 


+ 





WO 03/089624 



49 



PCTYUS03/09600 





Daudi 


+ 


+ 

* 


-f 




Raji 


+ 


+ 


+ 


T 


Jurkat 


+ 


- 


*• 


Monocytic 


THP-1 


+ 


- 


- 


Myelomonocytic 


U937 


+ 


- 


- 


rromyelocytic 


TTT £(\ 

HL-60 


+ 






Myelocytic 


KG-1 


+ 






Erythroid 


K562 






* 



FcRHl, FcRH2, and FcRH3 expression in cell lines was determined by RT-PCR. 



RT-PCR analysis of sorted populations of peripheral blood cells indicated that 
FcRHl, FcRH2, FcRH3, and FcRHS are expressed at relatively high levels in CD19+ B 
5 cells, whereas FcRH4 was expressed at only trace levels. FcRH3 expression was 

observed in CD3+ T cells whereas transcripts of FcRHl were barely detectable. FcRHl 
expression also was observed in circulating granulocytes. 

To refine the analysis of FcRH expression in secondary lymphoid tissues, 
tonsillar lymphocyte subpopulations were isolated. The five discrete subpopulations 
10 of B lineage cells, which can be distinguished by their differential expression of cell 
surface IgD and CD38, represent different stages in B cell differentiation: follicular 
mantle (IgD+CD38), pre-GC (IgD+CD38+), GC (IgDCD38+), memory (IgDCD3 8), 
and mature plasma cells (CD38 2 +) (Pascual, V. (1994) J. Exp. Med. 180: 329-339). RT- 
PCR analysis of FcRHl -5 expression in tonsillar B cell subpopulations was performed. 
1 5 Viable cells were magnetically sorted into GDI 9- non-B cells and CD19+ B cells. The 
latter were stained with anti-IgD and anti-CD38 mAbs, and the five subpopulations 
indicated (CD38- IgD-, CD38- IgD+, CD38+ IgD+, CD38+ IgD-, and CD38 2 +) were 
sorted by flow cytometry. RT-PCR analysis of FcRH transcripts in non-B cells and the 
B cell subpopulations was also performed. After cDNA preparation, PCR amplification 
20 was performed on the equivalent template of approximately 10 k cells. Glyceraldehyde- 
3-phosphate dehydrogenase (GADPH) was amplified as a positive control. 

RT-PCR analysis indicated little or no expression of FcRH transcripts in the 
non-B lineage CD 19- cells, most of which are T cells. However, CD 19+ subpopulations 
displayed coordinate expression of FcRHl, FcRH2, and FcRH3 transcripts in follicular 
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mantle, naive, GC, and memory B cell subpopulations, but yielded no evidence of 
FcRH transcripts in pre-GC B cells or plasma cells. In contrast, FcRH4 transcripts were 
restricted to the follicular mantle and memory B cells, whereas FcRH5 expression 

extended to mature plasma cells. 
5 The relationship between the five FcRHs was examined by comparing their full- 

length, extracellular, and individual Ig-like domain amino acid sequences. This 
analysis, which included a recently identified mouse FcRH ortholog (moFcRH) and 
members of the FcR family, used the CLUSTAL method algorithm (Higgins, D. G. & 
Sharp, P. M. (1989) Comput. Appl. Biosci. 5, 151-153). Comparison of the full-length 
10 sequences of other FcRH family members with FcRH3 indicated 40-47% identity. By 
comparison, the degree of FcRH3 homology with the moFcRH was found to be 35% 
and 21-24% with FcR members residing on chromosome 1, FcyRI, FcyRH, FcyRIfl, and 
FceRI. A lower level of amino acid identity (14%) was observed for the chromosome 
19 LRC member, FcaR. A slightly higher degree of extracellular homology was 
1 5 evident. Pairwise analysis of the individual Ig-like subunits indicated conservation in 
membrane-distal to membrane-proximal ordering of extracellular domain composition 
among family members. Although similar Ig domain subunits were shared among 
family members, the individual receptors were found to be composed of unique domain 
combinations. The extracellular domain configuration of the moFcRH most closely 
20 resembled that of FcRH2, with which it has 46% identity. The extended pairwise 

comparison of the FcRH family with known FcRs suggested the conservation of these 
Ig-like domains to some degree throughout the greater family. The resemblance is 
particularly evident in the FeRH3 membrane-distal domains that correspond to the three 
FcyRI domains and the two domains of FcyRII, FcyRffl, and the FcR y-chain. This 
25 analysis suggests the ancestral occurrence of differential duplication and diversification 
of the individual Ig-like subunits in the respective FcRH family members. The data also 
indicate that the FcRHs are more similar to their FcR neighbors on chromosome 1 than 
to their FcR relative on chromosome 19. 

The genomic sequence analysis of relevant chromosome lq21 BAC 
30 clones indicated that the entire FcRH locus spans 300 kb. The FcRH genes lie in the 
same transcriptional orientation toward the centomere. Exon-intron boundaries were 
characterized by sequence comparison of their respective cDNA clones and the AG/GT 
rule. The FcRHl gene consists of 1 1 exons and 10 introns spanning about 28 kb. The 
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first exon, 5 ! UT/S1, encodes the 5' UT region, the ATG translation initiation codon, 
and the first half of a split signal peptide. S2, the second exon, is 
separated from 5 1 UT/S1 by a long intron of 12.9 kb and, like the neighboring FcRs, is 
21 bp in length (van de Winkel, J. G. & Capel, P. J. (1993) Immunol. Today 14, 215- 
5 221; Kulczycki, A., Jr. et al. (1990) Proc. Natl. Acad. Sci. USA 87, 2856-2860; Pang, J. 
et al. (1994) J. Immunol. 151, 6166-6174.). The extracellular region is encoded by three 
closely clustered exons, EC1-EC3, that code for the three Ig-like domains. The 
membrane-proximal, transmembrane, and the proximal portion of the cytoplasmic 
domain are encoded by a single sixth exon, TM. The cytoplasmic tail is encoded by five 

10 exons, CY1-CY5, and the CY5 also encodes the beginning of the 3 1 UT region. 

FcRH2 contains 12 exons and 1 1 introns that span 30 kb. It also contains two 
exons that encode a split signal peptide, the first of which, 5TJT/S1, includes the 5 1 UT 
region, the ATG translation initiation codon, and first half of the signal peptide. The 
second exon, S2, is 21 bp in length. Exons 3-6 encode the four extracellular domains, 

1 5 EC1-EC4. The seventh exon encodes the membrane-proximal, transmembrane, and the 
proximal portion of the cytoplasmic domain. The FcRH2 cytoplasmic tail is encoded by 
five exons, CY1-CY5, the last exon of which includes the termination of the ORF and 
beginning of the 3' UT region. 

The FcRH3 gene consists of 16 exons and 15 introns that span about 24 kb. 

20 Unlike FcRHl and FcRffi, its 5' UT region is encoded by two exons, 5' UT1 and a 
second, 5TJT2/S1, that also encodes the ATG translation initiation codon and the 
beginning of the split signal peptide. The third exon, S2, is also 21 bp in length. 
Extracellular domains encoded by six exons, EC1-EC6, are followed by exon 10 that 
encodes the membrane-proximal, transmembrane, and the proximal portion of the 

25 cytoplasmic domain. The cytoplasmic tail is encoded by five exons, CY1-CY5; the last 
contains the beginning of the 3 f UT region. 
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Identification of HuFcRH6 
FcRH6 is located in the midst of the classical FcRs at lq21-23. Its genomic 
structure indicates, like the classical FcRs and FcRHl-5, a split hydrophobic signal 
peptide encoded by two exons the second of which is 21bp. 
5 FcRH6 was characterized using the methods described in Example 1 . A 

composite analysis of Ig-like domains for relatedness with the other huFcRHs was 
performed. See Figure xxx. Sequence analysis of huFcRH6 indicates its type I 
transmembrane form contains a consensus motif for a single HAM, or a single or two 

ITEM'S. 

1 0 Initial RT-PCR analysis of huFcRH6 in human tissues and cell lines (as 

described in Example 1) reveals transcript expression in normal tonsil and lymph 
nodes. In cell lines, expression of huFcRH6 was identified in myeloid cell lines THP-1 
(monocytic), U937 (myelomonocytic), and KG-1 (myelocytic). Limited expression if 
any was identified in the 207 pre-B cell line and the Daudi B cell line. 

15 

EXAMPLE 3 
Generation of Transfecta nts and Antibodies 
Recombinant constructs for transfection and stable expression of huFcRHl-5 
have been generated. The constructs have been ligated into a CMV driven mammalian 
20 expression vector with and without green fluorescent protein (GFP) fusion at the 
carboxyl terminus. Surface. expression of huFcRHl and huFcRTO was detected for 
both GFP and non-GFP forms by staining with antibody supernatant. The antibody 
supernatant was derived from hybridomas generated by mice immunized with 
recombinant extracellular protein of the respective FcRH. The constructs for huFcRH2, 
25 4, and 5 have been detected by green fluorescence as well as surface expression for 
FcRH4. 

Monoclonal antibodies have been generated, including, for example, an 
antibody that binds FcRHl . The preliminary analysis of FACS staining for FcRHl 
expression with monoclonal antibody 1-5A3 labeled with a FITC conjugate (mouse anti 
30 human FcRHl) in peripheral blood from normal volunteers indicates virtually all 
CD19+ B cells have huFcRHl expression, as do CD14+ monocytes and CD13+ 
granulocytes. CD3+ T cells have limited to no expression of FcRHl. Staining of B-CLL 
samples from two different patient peripheral blood samples indicates that virtually all 
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CD5+/CD19+ B-CLL cells are positive for the FcRHl 1-5A3 antigen. By western blot 
analysis of recombinant protein for FcRHl-5 extracellular regions 1-5 A3 appears 
specific for FcRHl. 1-5 A3 also stains B cell lines Daudi and Raji. 



5 EXAMPLE 4 

Identification of MoFcRHl-3 
A family of three mouse Fc Receptor Homologs (MoFCRHs) were identified 
and cloned. Amino acid sequences from the membrane proximal Ig-like domains of 
huFcRHl-5 were used to identify putative mouse FcRH orthologs in the NCBI or 
10 Celera genomic, EST, and protein databases using the protein BLAST (BLASTP) and 
the translated nucleotide BLAST (TBLASTN) algorithms, respectively. The 
location of moFcR family is split between chromosomes 1 and 3 in regions syntenic 
with human chromosome lq21-23. See Figure 4. The mo FcRH are located on mouse 
Ch3 . Approximate positions were determined from Genbank, Celera, and Mouse 
1 5 Genome Informatics databasesContigs of ESTs were generated to determine the 
putative cDNA sequences. 

Genomic organization was determined by comparing cDNA clones generated 
from RACE PCR with GenBank and Celera genomic sequences. DNAStar software 
was used for analysis of exon-intron boundaries which were characterized by sequence 
20 comparison and the AG/GT rule. All three genes contain a split signal sequence with a 
21 bp S2 exon (exon 2) which is found in all FcR and huFcRH genes on human 
chromosone 1. 

A comparison of tyrosine based motifs in FcRH cytoplasmic tails indicated 
homology with the huFcRH family. See Figure 5. An analysis of sequence homology 
25 conservation is further shown in Figures 6 and 7. 

Expression of the moFcRHs in tissue and cell lines was also characterized as 
described in Example 1 . Briefly, RT-PCR was performed on mouse tissues and cell 
lines with gene specific primers. Viable tissue was placed in TRIzol reagent for RNA 
extraction. After cDNA preparation PCT amplification was performed on equivalent 
30 template amounts. Actin was amplified as a positive control. McFcRH3 appears to 

have preferential expression in cells of B lineage. The results are shown in Tables 3-4. 
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Table 3: Tissue Distribution of moFcRH Expression 




5 



10 
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Table 4. Expression of moFcRH transcripts in cell lines 



Ce// 7>pe 


Cell line 


FcRHl 


FcRH2 


FcRH3 


Pro-B 


SCID7 


+ 


+/- 


+ 




Raw8.1 


+ 


+ 


- 


Pre-B 


70Z/3 


+ 


+ 


+ 




BC76 




+ 


+ 


■ 


18-81 


+ 


+ 




Imm. B 

• 


WEHI-231 






+ 




WEHI-279 




+ 




B 


A20 










X16C8.5 




+ 




T 


EL4 


+ 


+/- 


-/+ 






i 


4./ 




NKT 


2C12 


+ 


+/- 


m 


Myeloid 


WEHI-3 


+ 






Lymphoid 


YAC-1 




+ 




Fibroblast 


3T3 


+ 


+/- 





Expression in cell lines was determined by RT-PCR. 

5 The mouse FcReceptor Homologs include secreted or type I transmembrane 

isolfrms that have unique cytoplasmic tails with potential activation and inhibition 
motifs. Their chromosomal location, Ig domain homology, and genomic organization 
indicate the mouse FcReceptor Homologs are orthologs of the huFcRH that have 
evolved a significant level of diversity. moFcRHl, moFcRH2, and moFcRH3 are 

1 0 predicted to encode secreted or type I transmembrane proteins based on their amino 
acid sequences. moFcRHl has two secreted isoforms both of which have extracellular 
(EC) regions of four Ig-like domains with five potential sites for N-linked 
glycosylation. One isoform is a fusion protein with a type B scavenger receptor domain 
containing 8 cysteines. moFcRH2 has secreted and type I isoforms containing two Ig- 

15 like domains with five N-linked glycosylation sites. The type I isoform has an 

uncharged transmembrane region which the secreted isoform lacks. Both isoforms 
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contain the cytoplasmic portion which is long in the transmembrane form and contains 
five tyrosines including a consensus sequence for one potential immunoreceptor 
tyrosine-based activating motif . moFcRH3 contains five Ig-like domains with six 
potential sites of N-linked glycosylation. Its transmembrane domain is also uncharged 
5 and the cytoplasmic region contains one potential 1TAM and one potential 

immunoreceptor tyrosine-based inhibitory motif. The amino acid (aa) length of 
individual regions and full length (FL) isoforms, as well as approximate molecular 
weight (MW) in Daltons (Da), is indicated in the structural diagram of Figure 8. 
Throughout this application, various publications are referenced. The 
1 0 disclosures of these publications in their entireties are hereby incorporated by reference 
into this application in order to more fully describe the state of the art to which this 

invention pertains. 

It will be apparent to those skilled in the art that various modifications and 
variations can be made in the present invention without departing from the scope or 
1 5 spirit of the invention. Other embodiments of the invention will be apparent to those 
skilled in the art from consideration of the specification and practice of the invention 
disclosed herein. It is intended that the specification and examples be considered as 
exemplary only, with a true scope and spirit of the invention being indicated by the 
following claims. 

20 
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What is claimed is: 



1 . An isolated FcRH, comprising a cytoplasmic region with more than 107 or less 
than 104 amino acids, a transmembrance region, and an extracellular region. 

2. The isolated FcRH of claim 1, wherein the extracellular region comprises less 
than four Ig domains. 

3. The isolated FcRH of claim 2, wherein the cytoplamic region comprises less 
than 104 amino acids. 

* 

4. The isolated FcRH of claim 3, wherein the transmembrane region comprises an 
acidic amino acid. 

5. The isolated FcRH of claim 4, wherein the acidic amino acid is glutamate. 

6. The isolated FcRH of claim 2, wherein the cytoplasmic region comprises the 
amino acid sequence of SEQ ID NO: 1 

7. The isolated FcRH of claim 2, wherein the extracellular region comprises the 
amino acid sequence of SEQ ID NO:21. 

8. The isolated FcRH of claim 1, comprising the amino acid sequence of SEQ ID 
NO:2. 

9. The isolated FcRH of claim 1 , wherein the receptor is expressed by myeloid 
cells. 

1 0. The isolated FcRH of claim 9, wherein the receptor is expressed by T-cells. 

11. A polypeptide comprising the amino acid sequence of SEQ ID NO: 1 . 

12. A polypeptide comprising the amino acid of SEQ ID NO: 1 with conservative 
amino acid substitutions. 

13. A polypeptide comprising the amino acid sequence of SEQ ID NO:21 

14. A polypeptide comprising the amino acid of SEQ ID NO:21 with conservative 
amino acid substitutions. 

15. A polypeptide comprising the amino acid of SEQ ID NO:2. 

16. A polypeptide comprising the amino acid of SEQ ID NO:2 with conservative 
amino acid substitutions. 

17. The isolated FcRH of claim 1, wherein the cytoplasmic region comprises less than 
99 amino acids and wherein the receptor further comprises an extracellular region 
with up to four Ig domain and up to five N-linked glycosylation sites. 
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18. The isolated FcRH of claim 17, wherein the cytoplasmic region comprises the 
amino acid sequence of SEQ ID NO:3. 

19. The isolated FcRH of claim 17, wherein the extracellular region comprises the 
amino acid sequence of SEQ ID NO:22. 

20. The isolated FcRH of claim 1, comprising the amino acid sequence of SEQ ID 

NO:4. 

2 1 . A polypeptide comprising the amino acid sequence of SEQ ID NO:3 . 

22. A polypeptide comprising the amino acid of SEQ ID NO:3 with conservative 
amino acid substitutions. 

23 . A polypeptide comprising the amino acid sequence of SEQ ID NO:22 

24. A polypeptide comprising the amino acid of SEQ ID NO:22 with conservative 
amino acid substitutions. 

25. A polypeptide comprising the amino acid of SEQ ID NO:4. 

26. A polypeptide comprising the amino acid of SEQ ID NO:4 with conservative 
amino acid substitutions. 

27. The isolated FcRH of claim 1, wherein the cytoplasmic region comprises more 

than 107 amino acids. 

28. The isolated FcRH of claim 27, wherein the cytoplasmic region comprises the 

amino acid sequence of SEQ ID NO: 5. 

29. The isolated FcRH of claim 27, wherein the cytoplasmic region comprises the 
amino acid sequence of SEQ ID N0.23. 

30. The isolated FcRH of claim 27, wherein the extracellular region comprises the 
amino acid sequence of SEQ ID NO:24. 

3 1 . The isolated FcRH of claim 1, comprising the amino acid sequence of SEQ ID 

NO:6. 

32. The isolated FcRH of claim 1, comprising the amino acid sequence of SEQ ID 
NO:25. 

33. A polypeptide comprising the amino acid sequence of SEQ ID NO:5 

34. A polypeptide comprising the amino acid of SEQ ID NO:5 with conservative 

amino acid substitutions. 

35. A polypeptide comprising the amino acid sequence of SEQ ID NO:24 

36. A polypeptide comprising the amino acid of SEQ ID NO:24 with conservative 
amino acid substitutions. 
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37. A polypeptide comprising the amino acid sequence of SEQ ID NO:23 

38. A polypeptide comprising the amino acid of SEQ ID NO:23 with conservative 
amino acid substitutions. 

39. A polypeptide comprising the amino acid sequence of SEQ ID NO:6. 

40. A polypeptide comprising the amino acid of SEQ ID NO: 6 with conservative 
amino acid substitutions. 

41 . A polypeptide comprising the amino acid sequence of SEQ ID NO:25. 

42. A polypeptide comprising the amino acid of SEQ ID NO:25 with conservative 
amino acid substitutions. 

43 . The isolated FcRH of claim 1 , wherein the cytoplasmic region comprises the 
amino acid sequence of SEQ ID NO:26. 

44. The isolated FcRH of claim 1, wherein the extracellular region comprises the 
amino acid sequence of SEQ ID NO:27. 

45. The isolated FcRH of claim 1, comprising the amino acid sequence of SEQ ED 
NO:28. 

46. An isolated nucleic acid, comprising a nucleotide sequence that encodes the FcRH 
of claim 2. 

47. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID 
NO:l. 

48. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID 
NO:21. 

49. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID 
NO:2. 

50. The nucleic acid of claim 46, comprising the nucleotide sequence of SEQ ID 
NO:7. 

51 . An isolated nucleic acid comprising a sequence that hybridizes under highly 
stringent conditions to a hybridization probe, wherein the hybridization probe 
comprises the nucleotide sequence of SEQ ID NO:7, or the complement of SEQ ID 

NO:7. 

52. The nucleic acid of claim 46, comprising the nucleotide sequence of SEQ ID 
NO:13. 

53. An isolated nucleic acid comprising a sequence that hybridizes under highly 
stringent conditions to a hybridization probe, wherein the hybridization probe 
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comprises the nucleotide sequence of SEQ ID NO: 13 or the complement of SEQ 
IDNO:13. 

54. The nucleic acid of claim 46, comprising the nucleotide sequence of SEQ ID 
NO:8. 

55. An isolated nucleic acid comprising a sequence that hybridizes under highly 
stringent conditions to a hybridization probe, wherein the hybridization probe 
comprises the nucleotide sequence of SEQ ID NO:8, or the complement of SEQ ID 
NO:8. 

» 

56. A single stranded nucleic acid that hybridizes under stringent conditions to a 
nucleic acid having the sequence of SEQ ID NO:7, SEQ ID NO: 13 or SEQ ID 
NO:8. 

57. An isolated nucleic acid, comprising a nucleotide sequence that encodes the FcRH 
of claim 17. 

58. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID 
NO:3. 

59. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID 
NO:22. 

60. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID 
NO:4. 

61. The nucleic acid of claim 57, comprising the nucleotide sequence of SEQ ID 
NO:9. 

62. An isolated nucleic acid comprising a sequence that hybridizes under highly 
stringent conditions to a hybridization probe, wherein the hybridization probe 
comprises the nucleotide sequence of SEQ ID NO:9, or the complement of SEQ ID 

NO:9. 

63. The nucleic acid of claim 57, comprising the nucleotide sequence of SEQ ID 
NO: 14. 

64. An isolated nucleic acid comprising a sequence that hybridizes under highly 
stringent conditions to a hybridization probe, wherein the hybridization probe 
comprises the nucleotide sequence of SEQ ID NO: 14, or the complement of SEQ 
ID NO: 14. 

65. The nucleic acid of claim 57, comprising the nucleotide sequence of SEQ ID 
NO:10. 
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66. An isolated nucleic acid comprising a sequence that hybridizes under highly 
stringent conditions to a hybridization probe, wherein the hybridization probe 
comprises the nucleotide sequence of SEQ ID NO: 10, or the complement of SEQ 
ID NO: 10. 

67. A single stranded nucleic acid that hybridizes under stringent conditions to a 
nucleic acid having the sequence of SEQ ID NO:9, SEQ ID NO: 14, or SEQ ID 
NO:10. 

68. An isolated nucleic acid, comprising a nucleotide sequence that encodes the FcRH 
of claim 27. 

69. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID 

NO:5. 

70. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID 
NO:23. 

71 . An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID 
NO:24. 

72. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID 
NO:6. 

73. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID 
NO:25. 

74. The nucleic acid of claim 68, comprising the nucleotide sequence of SEQ ID 
NO:ll. 

75. An isolated nucleic acid comprising a sequence that hybridizes under highly 
stringent conditions to a hybridization probe, wherein the hybridization probe 
comprises the nucleotide sequence of SEQ ID NO: 1 1, or the complement of SEQ 
IDNO:ll. 

76. The nucleic acid of claim 68, comprising the nucleotide sequence of SEQ ID 
NO: 16. 

77. An isolated nucleic acid comprising a sequence that hybridizes under highly 
stringent conditions to a hybridization probe, wherein the hybridization probe 
comprises the nucleotide sequence of SEQ ID NO: 1 6, or the complement of SEQ 
ID NO: 16. 

78. The nucleic acid of claim 68, comprising the nucleotide sequence of SEQ ID 
NO:15. 
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79. An isolated nucleie acid comprising a sequence that hybridizes under highly 
stringent conditions to a hybridization probe, wherein the hybridization probe 
comprises the nucleotide sequence of SEQ ID NO: 1 5, or the complement of SEQ 
E>NO:15. 

80. The nucleic acid of claim 68, comprising the nucleotide sequence of SEQ ID 
NO: 12. 

81. An isolated nucleic acid comprising a sequence that hybridizes under highly 
stringent conditions to a hybridization probe, wherein the hybridization probe 
comprises the nucleotide sequence of SEQ ID NO: 12, or the complement of SEQ 
ID NO: 12. 

82. The nucleic acid of claim 68, comprising the nucleotide sequence of SEQ ID 
NO:17. 

83 . An isolated nucleic acid comprising a sequence that hybridizes under highly 
stringent conditions to a hybridization probe, wherein the hybridization probe 
comprises the nucleotide sequence of SEQ ID NO: 1 1 , or the complement of SEQ 
TD NO: 17. 

84. A single stranded nucleic acid that hybridizes under stringent conditions to a 
nucleic acid having the sequence of SEQ ID NO: 11, SEQ ID NO: 15, SEQ ID 
NO:16,orSEQIDNO:12. 

85. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID 

NO:26. 

86. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID 
NO:27. 

87. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID 
NO:28. 

88. An isolated nucleic acid comprising a sequence that hybridizes under highly 
stringent conditions to a hybridization probe, wherein the hybridization probe 
comprises the nucleotide sequence of SEQ ID NO: 1 8, or the complement of SEQ 
IDNO:18. 

89. An isolated nucleic acid comprising a sequence that hybridizes under highly 
stringent conditions to a hybridization probe, wherein the hybridization probe 
comprises the nucleotide sequence of SEQ ID NO:19, or the complement of SEQ 
IDNO:19. 
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90. An isolated nucleic acid comprising a sequence thartiybridizes under highly 
stringent conditions to a hybridization probe, wherein the hybridization probe 
comprises the nucleotide sequence of SEQ ID NO:20, or the complement of SEQ 
ID NO:20. 



9 1 . A single stranded nucleic acid that hybridizes under stringent conditions to a 
nucleic acid having the sequence of SEQ ID NO: 18, SEQ ID NO: 19, or SEQ ID 
NO:20. 

92. An expression vector comprising the nucleic acid of claim 46 operably linked to an 
expression control sequence. 

93. An isolated cell comprising the vector of claim 92. 

94. A method of making a FcRH, comprising culturing the cell of claim 93 under 
conditions permitting expression of the FcRH. 

95. An expression vector comprising the nucleic acid of claim 57 operably linked to an 
expression control sequence. 

96. An isolated cell comprising the vector of claim 95. 

97. A method of making a FcRH, comprising culturing the cell of claim 96 under 
conditions permitting expression of the FcRH. 

98. An expression vector comprising the nucleic acid of claim 56 operably linked to 

an expression control sequence. 

99. An isolated cell comprising the vector of claim 98. 

100. A method of making a FcRH, comprising culturing the cell of claim 99 under 
conditions permitting expression of the FcRH. 

101 . An expression vector comprising the nucleic acid of claim 57 operably linked to 
an expression control sequence. 

1 02 . An isolated cell comprising the vector of claim 101. 

103. A method of making a FcRH, comprising culturing die cell of claim 102 under 
conditions permitting expression of the FcRH. 

104. An expression vector comprising the nucleic acid of claim 67 operably linked to 
an expression control sequence. 

105. An isolated cell comprising the vector of claim 104. 

106. A method of making a FcRH, comprising culturing the cell of claim 105 under 
conditions permitting expression of the FcRH. 
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' 107. An expression vector comprising the nucleic acid of claim 68 operably linked to 
an expression control sequence. 

108. An isolated cell comprising the vector of claim 107. 

109. A method of making a FcRH, comprising culturing the cell of claim 108 under 
conditions permitting expression of the FcRH. 

1 10. An expression vector comprising the nucleic acid of claim 91 operably linked to 

an expression control sequence. 

111. An isolated cell comprising the vector of claim 110. 

1 12. A method of making a FcRH, comprising culturing the cell of claim 1 1 1 under 
conditions permitting expression of the FcRH. 

113. A purified antibody or immunonologic fragment thereof, wherein the antibody 
or fragment thereof selectively binds to the FcRH of claim 1 . 

1 14. A purified antibody or immunonologic fragment thereof, wherein the antibody 
or fragment thereof selectively binds to the FcRH of claim 2. 

115. A purified antibody or immunonologic fragment thereof, wherein the antibody 
or fragment thereof selectively binds to the FcRH of claim 17. 

116. A purified antibody or immunonologic fragment thereof, wherein the antibody 
or fragment thereof selectively binds to the FcRH of claim 27. 

117. The antibody or fragment of claim 113, wherein the antibody or fragment is a . 
monoclonal antibody or fragment thereof. 

118. The antibody or fragment of claim 113, wherein the antibody or fragment 
thereof is a humanized antibody, a fully human antibody, or a fragment thereof. 

119. The antibody or fragment of claim 113, wherein the antibody or fragment 
thereof is a single chain antibody or fragment thereof. 

120. The antibody or fragment of claim 113, wherein the antibody or fragment 
thereof is labeled. 

121. The antibody or fragment of claim 113, wherein the label is a radiolabel. 

122. The antibody or fragment of claim 113, wherein the antibody or fragment is 
conjugated or fused with a toxin. 

123. A purified antibody that selectively binds to the FcRH of claim 6, but not to the 

FcRH of claim 18, 28, or 43. 

124. A purified antibody that selectively binds to the FcRH of claim 18, but not to 

the FcRH of claim 6, 28, or 43. 
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125. A purified antibody that selectively binds to the FcRH of claim 28, but not to 
the FcRH of claim 6, 18, or 43. 

126. The purified antibody of claim 125, wherein the antibody does not bind to the 

FcRH of claim 29. 

1 27. A purified antibody that selectively binds to the FcRH of claim 29, but not to 
the FcRH of claim 6, 18, or 43. 

1 28 . The purified antibody of claim 127, wherein the antibody does not bind to the 

FcRH of claim 28. 

129. A purified antibody that selectively binds to the FcRH of claim 43, but not to 
the FcRH of claim 6, 1 8, or 28. 

130. A purified antibody that selectively binds to the FcRH of claim 7, but not to the 
FcRH of claim 19, 30, or 44. 

131. A purified antibody that selectively binds to the FcRH of claim 19, but not to 
the FcRH of claim 7, 30, or 44. 

132. A purified antibody that selectively binds to the FcRH of claim 30, but not to 
the FcRH of claim 7, 19, or 44. 

133. A purified antibody that selectively binds to the FcRH of claim 44, but not to 
the FcRH of claim 7, 19, or 30. 

134. A method of diagnosing a malignancy of hematopoietic cell lineage in a subject, 
comprising: 

(a) contacting a biological sample of the subject with the antibody of claim 
1 13 under conditions that allow the antibody to bind to an FcRH in the 
biological sample; 

(b) detecting the amount or pattern of binding by the antibody, 

changes in the amount or pattern of binding as compared to binding in a 
control sample indicating a malignancy of hematopoietic cell lineage in 
the subject 

135. The method of claim 134, wherein the malignancy of hematopoietic cell lineage 
is a malignancy of B cell lineage. 

136. The method of claim 134, wherein the malignancy of hematopoietic cell lineage 
is a malignancy of T cell lineage. 

1 37. The method of claim 134, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:l . 
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138. The method of claim 134, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:21. 

139. The method of claim 134, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:2. 

140. The method of claim 134, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:3. 

141 . The method of claim 134, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:22. 

142. The method of claim 134, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:4. 

1 43 . The method of claim 134, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:5. 

144. The method of claim 134, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:23. 

145. The method of claim 134, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:24. 

146. The method of claim 1 34, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:6. 

147. The method of claim 1 34, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:25. 

148. The method of claim 1 34, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:26. 

149. The method of claim 134, wherein the antibody selectively binds an FcRH 

i 

having the amino acid sequence of SEQ ID NO:27. 

150. The method of claim 1 34, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:28. 

151. A method of diagnosing a malignancy of hematopoietic cell lineage in a subject, 

comprising: 

(a) contacting the nucleic acid of claim 56 with a biological sample of the 
subject under conditions that allow the nucleic acid to hybridize with an 
FcRH in the biological sample; 
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(b) detecting the amount or pattern of binding by the nucleic acid, 

changes in the amount or pattern of binding as compared to binding in a 
control sample indicating a malignancy of hematopoietic cell lineage in 
the subject 

1 52. The method of claim 151, wherein the malignancy of hematopoietic cell lineage 
is a malignancy of B cell lineage. 

153. The method of claim 151, wherein the malignancy of hematopoietic cell lineage 
is a malignancy of T cell lineage. 

154. A method of diagnosing a malignancy of hematopoietic cell lineage in a subject, 
comprising: 

(a) contacting the nucleic acid of claim 67 with a biological sample of the 
subject under conditions that allow the nucleic acid to hybridize with the 
biological sample; 

(b) detecting the amount or pattern of binding by the nucleic acid, 
changes in the amount or pattern of binding as compared to binding in a 
control sample indicating a malignancy of hematopoietic cell lineage. 

1 55. The method of claim 154, wherein the malignancy of hematopoietic cell lineage 
is a malignancy of B cell lineage. 

156. The method of claim 1 54, wherein the malignancy of hematopoietic cell lineage 
is a malignancy of T cell lineage. 

1 57. A method of diagnosing a malignancy of hematopoietic cell lineage in a subject, 
comprising: 

(a) contacting the nucleic acid of claim 84 with a biological sample of the 
subject under conditions that allow the nucleic acid to hybridize with the 
biological sample; 

(b) detecting the amount or pattern of binding by the nucleic acid, 
changes in the amount or pattern of binding as compared to binding in a 
control sample indicating a malignancy of hematopoietic cell lineage. 

158. The method of claim 157, wherein the malignancy of hematopoietic cell lineage 
is a malignancy of B cell lineage. 

1 59. The method of claim 1 57, wherein the malignancy of hematopoietic cell lineage 
is a malignancy of T cell lineage. 
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1 60. A method of diagnosing a malignancy of hematopoietic cell lineage in a subject, 
comprising: 

(a) contacting the nucleic acid of claim 9 1 with a biological sample of the 
subject under conditions that allow the nucleic acid to hybridize with the 

biological sample; 

(b) detecting the amount or pattern of binding by the nucleic acid, 
changes in the amount or pattern of binding as compared to binding in a 
control sample indicating a malignancy of hematopoietic cell lineage. 

1 6 1 . The method of claim 1 60, wherein the malignancy of hematopoietic cell lineage 
is a malignancy of B cell lineage. 

1 62. The method of claim 1 60, wherein the malignancy of hematopoietic cell lineage 

is a malignancy of T cell lineage. 

163. A method of treating a malignancy of hematopoietic cell lineage in a subject, 
comprising contacting the subject's malignant cells with a therapeutically effective 
amount of the antibody of claim 113. 

1 64. The method of claim 1 63 , wherein the malignancy of hematopoietic cell lineage 

is a malignancy of B cell lineage. 

165. The method of claim 163, wherein the malignancy of hematopoietic cell lineage 

is a malignancy of T cell lineage. 

1 66. The method of claim 163, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ED NO : 1 . 

1 67. The method of claim 163, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:21. 

168. The method of claim 163, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:2. 

169. The method of claim 163, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:3 . 

170. The method of claim 163, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:22. 

171 . The method of claim 1 63, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:4. 

172. The method of claim 163, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:5. 
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1 73 . The method of claim 1 63 , wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:23. 

174. The method of claim 1 63, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:24. 

1 75 . The method of claim 1 63 , wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO: 6. 

176. The method of claim 163, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:25. 

177. The method of claim 163, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:26. 

178. The method of claim 163, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:27. 

179. The method of claim 163, wherein the antibody selectively binds an FcRH 
having the amino acid sequence of SEQ ID NO:28. 

1 80. A method of treating a malignancy of hematopoietic cell lineage in a subject, 
comprising contacting the subject's malignant cells with a therapeutically effective 
amount of the nucleic acid of claim 56. 

181. A method of treating a malignancy of hematopoietic cell lineage in a subject, 
comprising contacting the subject's malignant cells with a therapeutically effective 
amount of fee nucleic acid of claim 67. 

182. A method of treating a malignancy of hematopoietic cell lineage in a subject, 
comprising contacting the subject's malignant cells with a therapeutically effective 
amount of the nucleic acid of claim 84. 

183. A method of treating a malignancy of hematopoietic cell lineage in a subject, 
comprising contacting the subject's malignant cells with a therapeutically effective 
amount of the nucleic acid of claim 9 1 . 

1 84. A method of diagnosing an autoimmune disease in a subject, comprising: 

(a) contacting a biological sample of the subject with the antibody of claim 
1 1 3 under conditions that allow the antibody to bind to FcRH in the 
biological sample; 

(b) detecting the amount or pattern of binding by the antibody, 

changes in the amount or pattern of binding as compared to binding in a 
control sample indicating an autoimmune disease in the subject. 
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185. A method of diagnosing an autoimmune disease in a subject, comprising: 

(a) contacting the nucleic acid of claim 56 with a biological sample of the 
subject under conditions that allow the nucleic acid to hybridize with the 

biological sample; 

(b) detecting the amount or partem of binding by the nucleic acid, 
changes in the amount or pattern of binding as compared to binding in a 
control sample indicating an autoimmune disease. 

186. A method of diagnosing an autoimmune disease in a subject, comprising: 

(a) contacting the nucleic acid of claim 67 with a biological sample of the 
subject under conditions that allow the nucleic acid to hybridize with the 

biological sample; 

(b) detecting the amount or pattern of binding by the nucleic acid,, 
changes in the amount or pattern of binding as compared to binding in a 
control sample indicating an autoimmune disease. 

1 87. A method of diagnosing an autoimmune disease in a subject, comprising: 

(a) contacting the nucleic acid of claim 84 with a biological sample of the 
subject under conditions that allow the nucleic acid to hybridize with the 
biological sample; 

(b) detecting the amount or pattern of binding by the nucleic acid, 
changes in the amount or pattern of binding as compared to binding in a 
control sample indicating an autoimmune disease. 

188. A method of diagnosing an autoimmune disease in a subject, comprising: 

(a) contacting the nucleic acid of claim 9 1 with a biological sample of the 
subject under conditions that allow the nucleic acid to hybridize with the 

biological sample; 

(b) detecting the amount or pattern of binding by the nucleic acid, 
changes in the amount or pattern of binding as compared to binding in a 
control sample indicating an autoimmune disease. 

189. A method of treating an autoimmune disease in a subject, comprising 
contacting, with a therapeutically effective amount of the antibody of claim 113, 
one or more FcRH expressing cells of the subject. 

190. A method of treating an autoimmune disease in a subject, comprising 
contacting, with a therapeutically effective amount of the nucleic acid of claim 56, 
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FcRH expressing cells of the subject. 

191. A method of treating an autoimmune disease in a subject, comprising 
contacting, with a therapeutically effective amount of the nucleic acid of claim 67, 
FcRH expressing cells of the subject. 

192. A method of treating an autoimmune disease in a subject, comprising 
contacting, with a therapeutically effective amount of the nucleic acid of claim 84, 
FcRH expressing cells of the subject. 

1 93 . A method of treating an autoimmune disease in a subject, comprising 

■ 

contacting, with a therapeutically effective amount of the nucleic acid of claim 91, 
FcRH expressing cells of the subject. 

194. A method of modulating a humoral immune response in a subject, comprising 
administering to the subject the isolated FcRH of claim 1 . 

195. A method of modulating a humoral immune response in a subject, comprising 
administering to the subject the antibody of claim 113. 

196. A method of modulating a humoral immune response in a subject, comprising 
administering to the subject the nucleic acid of claim 56. 

1 97. A method of modulating a humoral immune response in a subject, comprising 
administering to the subject the nucleic acid of claim 67. 

198. A method of modulating a humoral immune response in a subject, comprising 
administering to the subject the nucleic acid of claim 84. 

199. A method of modulating a humoral immune response in a subject, comprising 
administering to the subject the nucleic acid of claim 91 . 

200. An isolated mouse FcRH isoform of FcRHl, wherein the isoform lacks a 
cytoplasmic region. 

20 1 . A polypeptide comprising the amino acid sequence of SEQ ID NO: 70. 

202. A polypeptide comprising the amino acid of SEQ ID NO:70 with conservative 
amino acid substitutions. 

203 . An isolated mouse FcRH isoform of FcRH2, wherein the FcRH lacks a 
transmembrane region. 

204. A polypeptide comprising the amino acid sequence of SEQ ID NO:73. 

205. A polypeptide comprising the amino acid of SEQ ID NO: 73 with conservative 
amino acid substitutions. 

206. A polypeptide comprising the amino acid of SEQ ID NO: .77. 
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207. A polypeptide comprising the amino acid of SEQ ID NO:77 with conservative 
amino acid substitutions. 

208. A polypeptide comprising the amino acid sequence of SEQ ID NO:78. 

209. A polypeptide comprising the amino acid sequence of SEQ ID NO:78 with 
conservative amino acid substitutions. 

210. A nucleic acid encoding the isolated mouse FcRH isoform of claim 200. 

211. A nucleic acid encoding the isolated mouse FcRH isoform of claim 203 . 

212. A nucleic acid encoding the polypeptide of claim 201 . 

213. A nucleic acid encoding the polypeptide of claim 202. 

214. A nucleic acid encoding the polypeptide of claim 204. 

215. A nucleic acid encoding the polypeptide of claim 205. 

216. A nucleic acid encoding the polypeptide of claim 206. 

217. A nucleic acid encoding the polypeptide of claim 207 . 

218. A nucleic acid encoding the polypeptide of claim 208. 
219: A nucleic acid encoding the polypeptide of claim 209. 
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UJ « » I" i 




CO i i i « 




Q_ till 


i LU 


CO i t » i 







00 



CO 
CO 



CO 
CD 



CD 

CO 



00 



00 



CO 



CD CD ^ 

CD > OS 

O- CD CD 

^ CO 

cn ^ 



CO 



. 1 ^° 

UJ I — 
00 Q_ 



CXL CD 

CD CD 



< C£ U) 
C£ C£ I 

< S i 
>- o i 

re rrr i 



^ c/> 

i i 

i i 

i t 

i i 



cn 


CO 






CO 


















00 


HQS 






<c 

_u 




















<C 






Q 






UJ 




UJ 








o 




& 






•UJ 
UJJ 


















LU 
LU 









LU CL. 
< 



UJ 



CO 
CO 

to CO <j 



OO 



CD CD 
CO OO 



cO 
OO 



So 



CO oo oo 



Lu 



^n cn 
§n 

CO 



to 



CD 



o 




00 


1 


i 


1 


1 


i 


1 


1 


i 


1 


1 


i 


1 


OO 






Q 


UJ 






LU 


o 




:e 


UJ 


LU 


LU 




LL 


















CO 


1 


o 






<E 






00 


UJ 


CO 


z 


LU 



M CD 

CL, CO 

i CO 

i O" 

I LL. 

CO CO 

CD LU 

^ !£ 



o 

DC 

| 

I 
I 
I 
I 

CO 
LU 
OO 
UJ 



LO 

■ 

CD 

LL. 



£3 



H (M CO 



LO C\J CO 



cn cn cn cn oc 

CJ U (JU L) O CJ 

LL. O. LL U. U. LL LL 

=3 =5 =3 3 rr o o 



CM 


CO 




LO 


cn 


RH 


RH 


IE 




a 








LU 























WO 03/089624 



PCT/US03/09600 



8/10 




11 



rO 

E 



ro 



^ CM 



^ ^ 

5 to 



CM 




n H ca 



« fe§ fc§ *e fr« 

O iO O CM 

co m cm 




E 

ITS 




fc< « t?3 



CM 



8 



« fee 

« i ^ £3 



eg 

o 
o 



CO ^ 

o 




iij o 



\ 



in 



ro 




C5 



fr5 fe* M 



ir> cm 



IK? i£ &5 



1 1 



Is =1 



cm ro 



in 



o « JS .s^ .o 

ra ^ ^5 



WO 03/089624 



PCTYUS03/09600 



9/10 



CO 



7 



I® 



II 



TTT 




1 1 



mm 



! i 



mm- 



nr 



MO 
<DMO 







7 



••OS- 



en- 



4s 





WO 03/089624 



PCT/US03/09600 




WO 03/089624 



SEQUENCE LISTING 



PCTVUS03/09600 



<110> The UAB Research Foundation 
Davis, Randall S. 
Cooper, Max D. 

<120> MEMBERS OF THE FC RECEPTOR HOMOLOG GENE FAMILY (FCRH1-3, 6), 
RELATED REAGENTS, AND USES THEREOF 

<130> 21085. 0037P1 

<141> 2003-03-25 

<150> US 60/367,667 

<151> 2002-03-25 

<160> 102 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 99 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 1 



Lys Arg 


Lys 


He 


Gly Arg Arg Ser Ala Arg 


Asp 


Pro Leu Arg Ser Leu 


1 






5 


10 




15 


Pro Ser 


Pro 


Leu 


Pro Gin Glu Phe Thr 


Tyr 


Leu 


Asn Ser Pro Thr Pro 






20 


25 






30 


Gly Gin 


Leu 


Gin 


Pro He Tyr Glu Asn 


Val 


Asn 


Val Val Ser Gly Asp 




35 




40 






45 


Glu Val 


Tyr 


Ser 


Leu. Ala Tyr Tyr Asn 


Gin 


Pro 


Glu Gin Glu Ser Val 


50 






55 






60 


Ala Ala 


Glu 


Thr 


Leu Gly Thr His Met 


Glu 


Asp Lys Val Ser Leu Asp 


65 






70 




75 


80 


He Tyr 


Ser 


Arg 


Leu Arg Lys Ala Asn 


He 


Thr 


Asp Val Asp Tyr Glu 








85 


90 




95 


Asp Ala 


Met 













<210> 2 
<211> 413 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 2 

Ala Glu Leu Phe Leu He Ala Ser Pro Ser His Pro Thr Glu Gly Ser 

15 10 15 

Pro Val Thr Leu Thr Cys Lys Met Pro Phe Leu Gin Ser Ser Asp Ala 

20 25 30 
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Gin Phe Gin Phe Cys Phe Phe Arg Asp Thr Arg Ala Leu Gly Pro Gly 

35 40 45 

Trp Ser Ser Ser Pro Lys Leu Gin lie Ala Ala Met Trp Lys Glu Asp 

50 55 60 

Thr Gly Ser Tyr Trp Cys Glu Ala Gin Thr Met Ala Ser Lys Val Leu 
65 " 70 75 

Arg Ser Arg Arg Ser Gin He Asn Val His Arg Val Pro Val Ala Asp 

85 90 ys 

Val Ser Leu Glu Thr Gin Pro Pro Gly Gly Gin Val Met Glu Gly Asp 

100 105 HO 

Arg Leu Val Leu He Cys Ser Val Ala Met Gly Thr Gly Asp lie Thr 

115 120 I 25 

Phe Leu Trp Tyr Lys Gly Ala Val Gly Leu Asn Leu Gin Ser Lys Thr 

130 " ' 135 140 

Gin Arg Ser Leu Thr Ala Glu Tyr Glu He Pro Ser Val Arg Glu Ser 

150 155 i o u 

Asp Ala Glu Gin Tyr Tyr Cys Val Ala Glu Asn Gly Tyr Gly Pro Ser 

165 I 70 
Pro Ser Gly Leu Val Ser He Thr Val Arg He Pro Val Ser Arg Pro 

180 185 I 90 

He Leu Met Leu Arg Ala Pro. Arg Ala Gin Ala Ala Val Glu Asp Val 

195 200 205 

Leu Glu Leu His Cys Glu Ala Leu Arg Gly Ser Pro Pro He Leu Tyr 



145 



165 



210 215 220 

Trp Phe Tyr His Glu Asp lie Thr Leu Gly Ser Arg Ser Ala Pro Ser. 

225 230 235 

Glv Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Glu Glu His Ser Gly 

245 250 255 

Asn Tyr Ser Cys Glu Ala Asn Asn Gly Leu Gly Ala Gin Arg Ser Glu 

260 ' 265 270 

Ala Val Thr Leu Asn Phe Thr Val Pro Thr Gly Ala Arg Ser Asn His 

275 280 285 

Leu Thr Ser Gly Val He Glu Gly Leu Leu Ser Thr Leu Gly Pro Ala 

290 295 3°° 

Thr Val Ala Leu Leu Phe Cys Tyr Gly Leu Lys Arg Lys He Gly Arg 

310 315 J^u 

Arg Ser Ala Arg Asp Pro Leu Arg Ser Leu Pro Ser Pro Leu Pro Gin 

325 330 3oo 

Glu Phe Thr Tyr Leu Asn Ser Pro Thr Pro Gly Gin Leu Gin Pro He 

340 345 350 

Tyr Glu Asn Val Asn Val Val Ser Gly Asp Glu Val Tyr Ser Leu Ala 

355 360 365 

Tvr Tyr Asn Gin Pro Glu Gin Glu Ser Val Ala Ala Glu Thr Leu Gly 

370 375 380 

Thr His Met Glu Asp Lys Val Ser Leu Asp He Tyr Ser Arg. Leu Arg 
385 390 395 400 

Lys Ala Asn He Thr Asp Val Asp Tyr Glu Asp Ala Met 

405 410 

<210> 3 
<211> 86 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> "Description of Artificial Sequence : /note = 
synthetic construct 

<400> 3 . „, 

His Lys He Ser Gly Glu Ser Ser Ala Thr Asn Glu Pro Arg Gly Ala 

1 5 10 I 5 
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Ser Arg Pro Asn 

20 

Met Glu Glu Leu 
35 

Asp Val Val Tyr 
50 

Ala Asn lie Arg 
65 

Ser Ser Val Lys 



Pro Gin Glu Phe 

Gin Pro Val Tyr 

40 

Ser Gin Val Trp 
55 

Thr Leu Leu Glu 

70 
Lys Ser 
85 



Thr Tyr Ser Ser 
25 

Val Asn Val Gly 



Ser Met Gin Gin 

60 

Asn Lys Asp Ser 
75 



Pro Thr Pro Asp 
30 

Ser Val Asp Val 
45 

Pro Glu Ser Ser 

Gin Val He Tyr 

80 



<210> 4 
<211> 489 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 4 

Leu Thr Leu Val 
1 

Leu Lys Cys Gin 

20 

His Lys Asp Asn 
35 

Leu He Gin Ser 
50 

Thr Lys Gly Gin 
65 

He Lys Val Gin 

Phe Gin Pro He 

100 

Leu Ser Pro Gin 
115 

Glu Asn Gin Val 
130 

He Ser Ala Val 
145 

Glu Thr Val Thr 

His Val Gin Arg 

180 

Pro Gly Gly Gin 
195 

Val Ala Gly Gly 
210 

Thr Gly Thr Ser 
225 

Leu Glu He Pro 

Arg Ala Asp Asn 

260 

Pro Val Arg He 
275 

Gly Ala Gin Ala 
290 

Leu Arg Gly Ser 



Ala 


Pro 


Ser 


Ser 


5 








Gly 


Glu 


Gin 


Asn 


Lys 


Glu 


Leu 


Ser 








40 


Ala 


Val 


Leu 


Ser 






55 




Leu 


Phe 


Leu 


Trp 




70 






Glu 


Leu 


Phe 


Gin 


85 








Glu 


Gly 


Gly 


Pro 


Arg 


Leu 


Asp 


Val 








120 


Leu 


Gly 


Ser 


Gly 






135 




Trp 


Ser 


Glu 


Asp 




150 






His 


Arg 


He 


Arg 


165 








He 


Pro 


He 


Ser 


Val 


Thr 


Glu 


Gly 








200 


Thr 


Gly 


Asn 


Val 






215 




Met 


Gly 


Lys 


Lys 




230 






Ala 


Val 


Lys 


Glu 


245 








Gly 


His 


Val 


Pro 


Pro 


Val 


Ser 


Arg 








280 


Ala 


Val 


Gly 


Asp 






295 




Pro 


Pro 


He 


Leu 



Val 


Phe 


Glu 


Gly 




10 






Trp 


Lys 


He 


Gin 


25 








Val 


Phe 


Lys 


Lys 


Asp 


Ser 


Gly 


Asn 








60 




Ti\/ Q 


± ii j. 








75 




Arg 


Pro 


Val 


Leu 




90 






Val 


Ser 


Leu 


Lys 


105 








Gin 


Leu 


Gin 


Phe 


Trp 


Ser 


Ser 


Ser 








140 


Thr 


Gly 


Ser 


Tyr 






155 




Lys 


Gin 


Ser 


Leu 




170 






Asn 


Val 


Ser 


Leu 


185 








Gin 


Lys 


Leu 


He 


Thr 


Phe 


Ser 


Trp 








220 


Thr 


Gin 


Arg 


Ser 






235 




Ser 


Asp 


Ala 


Gly 




250 






He 


Gin 


Ser 


Lys 


265 








Pro 


Val 


Leu 


Thr 


Leu 


Leu 


Glu 


Leu 








300 


Tyr 


Gin 


Phe 


Tyr 



21 on 


Ser 


lie 


Val 






15 




Lys 


Met 


Ala 


Tyr 




30 






Phe 


Ser 


Asp 


Phe 


45 








Tyr 


Phe 


Cys 


Ser 


Asn 


He 


Val 


Lys 








80 


Thr 


Ala 


Ser 


Ser 






95 




Cys 


Glu 


Thr 


Arg 




110 






Cys 


Phe 


Phe 


Arg 


125 








Pro 


Glu 


Leu 


Gin 


Trp 


Cys 


Lys 


Ala 








160 


Gin 


Ser 


Gin 


He 






175 




Glu 


He Arg Ala 




190 






Leu 


Leu Cys 


Ser 


205 








Tyr 


Arg Glu 


Ala 


Leu 


Ser 


Ala 


Glu 








240 


Lys 


Tyr 


Tyr 


Cys 






255 




Val 


Val 


Asn 


He 




270 






Leu 


Arg Ser 


Pro 


285 








His 


Cys 


Glu 


Ala 


His 


Glu Asp Val 
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310 315 320 

Thr Leu Gly Asn Ser Ser Ma Pro Ser Gly Sly Gly Ala Ser Phe Asn 

325 330 

Leu Ser Leu Thr Ala Glu His Ser Gly Asn Tyr Ser Cys Glu Ala Asn 

340 345 350 

Asn Gly Leu Gly Ala Gin Cys Ser Glu Ala Val Pro Val Ser lie Ser 

355 360 365 

Gly Pro Asp Gly Tyr Arg Arg Asp Leu Met Thr Ala Gly Val Leu Trp 

' 370 375 380 

Gly Leu Phe Gly Val Leu Gly Phe Thr Gly Val Ala Leu Leu Leu Tyr 
385 390 o y o 

Ala Leu Phe His Lys lie Ser Gly Glu Ser Ser Ala Thr Asn Glu Pro 

405 410 415 

Arg Gly Ala Ser Arg Pro Asn Pro Gin Glu Phe Thr Tyr Ser Ser Pro 

420 425 430 

Thr Pro Asp Met Glu Glu Leu Gin Pro Val Tyr Val Asn Val Gly Ser 

435 440 445 

Val Asp Val Asp Val Val Tyr Ser Gin Val Trp Ser Met Gin Gin Pro 

450 455 460 

Glu Ser Ser Ala Asn lie Arg Thr Leu Leu Glu Asn Lys Asp Ser Gin 
465 470 475 480 

Val He Tyr Ser Ser Val Lys Lys Ser 

485 

<210> 5 
<211> 140 
<212> PRT 

<213> Artificial Sequence 

<220> . . 

<223> Description of Artificial Sequence: /note - 

synthetic construct 

His°Tyr Ala Arg Ala Arg Arg Lys Pro Gly Gly Leu Ser Ala Thr Gly 

1 5 10 

Thr Ser Ser His Ser Pro Ser Glu Cys Gin Glu Pro Ser Ser Ser Arg 

20 25 30 

Pro Ser Arg He Asp Pro Gin Glu Pro Thr His Ser Lys Pro Leu Ala 

35 40 45 

Pro Met Glu Leu Glu Pro Met Tyr Ser Asn Val Asn Pro Gly Asp Ser 

50 55 60 

Asn Pro He Tyr Ser Gin lie Trp Ser lie Gin His Thr Lys Glu Asn 

70 ^ 
Ser Ala Asn Cys Pro Met Met His Gin Glu His Glu Glu Leu Thr Val 

85 90 95 

Leu Tyr Ser Glu Leu Lys Lys Thr His Pro Asp Asp Ser Ala Gly Glu 

100 1° 5 110 

Ala Ser Ser Arg Gly Arg Ala His Glu Glu Asp Asp Glu Glu Asn Tyr 

115 120 125 

Glu Asn Val Pro Arg Val Leu Leu Ala Ser Asp His 
130 135 140 

<210> 6 
<211> 717 
<212> PRT " 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note - 
synthetic construct 
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<400> 6 



uj. y 


Va 1 
v di 


Ala 
fix. a 


IT X LJ 




Ala 


Va 1 

V Cll 


T»»i i 

XJC u. 


Tiiai l 


Ti^i i 
lie u 


1 








5 










10 

X \J 


ai a 

nla. 


IT lie 


T.\7 c 

Jjy lb 


v^iy 


ulu 


jjys 


V cil 


Ala 
nla 


licU 


lie 








Z \J 










9 ^ 




Ocl 


Lie la 


Ai a 


pi n 


biy 


A cn 
nop 


l ill 


iyr 


lip 


iyr 






35 










40 






uy o 


Tip 
lie 


Liys 


nio 


A <5TN 


uy o 


Tl p 
lie 


Rl n 

vJlll 


lie 


±ili 




50 

L/ 










55 








oys 


jjys 


1 xlx. 




uiy 


OCX 


oer 


jjeu 


JCl 


T\ QV-V 












1C\ 
I u 










C q 
Oci. 


JT X. LJ 


A cn 
nop 


lip 


XJC LI 


Tip 
lie 


i!CU 


bill 


A T a 
rild 


JjcU 






















ASp 


noil 


V a.1 


lie 


jueu 


TV V>rT 

Arg 


Pirn 

bys 


bin 


pi,, 

bly 


Lys 








1 u u 










i nt% 

1U0 




bin 


juys 


\7a T 
Val 


rpi tv- 
il yr 


iyr 


X t f r* 

i»ys 


Asp 


bly 


jjys 


bin 






i i c: 
113 










i 9n 

1ZU 








pi ii 


I tip 

ijys 


lie 


inr 


Val 


Asn 


oer 


T7_. 1 

vai 


O /-k ■v 

oer 




i ^n 

1 o U 










Ijj 








Hi cs 


bys 


rp V-i -u- 

i nr 


Hid. 


iyr 


Arg 


X t t m 

i«ys 


r>u a 
rile 


i yr 


lie 


1 ft O 










IjU 










Dei 


1 "t T C* 

hys 


Fro 


jjeu 


Asn 


TIQ 

l le 


bin 


vai 


pi „ 
bin 


blU 










1DO 










i 7 n 
1 / u 


jjeu 


Arg 


7\ 1 -a 

Aia 


oer 


C ^ -v- 

oer 


O a V 

oer 


inr 


v> y-v 

Fro 


Ti- 
ne 


pi 

blU 








i on 
i o u 










1 QC 

1 bD 




lux 


cys 


Pin 

blU 


i nr 


bin 


jjeu 


O -v* 

oer 


v» /-v 

Fro 


bin 


7V -tp pr 

Arg 






1 QC, 










o nn 
zuu 






Fne 


oer 


lieu 


Jrlie 


7V "v^ 

Arg 


ASp 


oer 


bin 


rn 1— v 

inr 


Leu 




91 n 

Z1U 










oi c 
£lj 








oer 


Fro 


7\ **** p*r 

Arg 


Leu 


bin 


lie 


V"j -w— >s 

Fro 


7\ "1 _ 

Ala 


Met: 


Trp 


9 o c 

Z Z O 




















iyr 


irp 


Cys 




val 


blU 


inr 


vai 


rp V-. -w^ 

Inr 


T_T ' _ 

nlS 










^ fi O 










ocn 

zo u 


jjeu 


7\ -v- pv 

Arg 


O P"V V" 

ber 


bin 


lie 


Arg 


val 


bin 


Arg 


Val 


















z bo 




.us u 


Pill 


lie 


T\ v* 

Arg 


rro 


inr 


biy 


Gly 


bin 


Leu 






9 7 R 

Z 1 D 










? Pin 






vai 


jjeu 


ne 


Cys 


C? y-^ in 

oer 


vai 


Ala 


pi _ 

bin 


bly 


O P^ 

oer 




z u 


















irp 


Hi o 

nlS 


X t r £i 

l»ys 


blU 


Gly 


Arg 


vai 


7\ V y-^V 

Arg 


O V* 

oer 


Jjeu 












OlU 












lieu 


±jeu 


Aia 


blU 


lieu 


illS 


vai 


jjeu 


rn t#» 

inr 










J ^ o 












p.1 \7 

oiy 


n.1 y 


iyr 


iyr 


bys 


Aia 


Aia 


ASp 


Asn 


T7 3 1 

vai 






















TVi y* 


lip 


Tl a 
11c 




V Cll 


1 liX 


Va 1 
v ell 


nig 


Tin 

ne 


Jrro 






»J 
















Thr 


Phe 


Arrr 
ax y 


A 1 ^ 


JT 1 \J 


A r*rr 
rlx y 


AT a 
nlo 


Hi c; 
ill o 


i in 


Va 1 
V cll 




370 


















XJC LA 


Hi q 

All O 


^ys 


ul U. 


oexv 


XJCU 


ril y 


PI , 7 
bly 


oer 


Fro 






















Tyr 


ills 


blU 


T\ -v-s 

ASp 


val 


Tnr 


Leu 


Gly 


Asn 


Ser 










405 










410 


Gly 


Ala 


Ser 


Phe 


Asn 


Leu 


Ser 


Leu 


Thr 


Ala 








420 










425 




Ser 


Cys 


Asp 


Ala 


Asp 


Asn 


Gly 


Leu 


Gly 


Ala 






435 










440 






Ser 


Leu 


Arg 


Val 


Thr 


Val 


Pro 


Val 


Ser 


Arg 




450 










455 







* 
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A «*n 

noil 


riu 


Prn 

C 1 \J 


Trn 
up 


OCX 

15 

1 >J 


Th r 

x i ix 


v»y o 


oci 


Gar 

oci 


Tip 

11C 

30 


cpr 

OCX 


Hiq 
m o 


Hi q 

Xi 1 o 




\J X LI 

45 


uy o 


XJC u 


xjc ix 


VJl IX 


60 




A <?n 
noil 


iyi 


GJ n 

UIH 


Ala 


Va 1 

V Cll 


Hi q 
nio 


Va 1 
v ai 


ul u 


Php 
it lie 


75 










80 


Hi «? 

ill o 


r i u 


Va 1 

V cll 


IT lie 


m n 

VJl LI 


oiy 


riop 


noil 


T.i/e 


noil 

110 


Xill 


Hi q 

ill o 


JjcU 


Fro 


Asn 
125 


Q 

Oci 


iyr 


Asn 


Arg 


ASp 

140 


nSU 


oer 


T T f P 

jjys 


iyr 


iieu 


nSp 


ne 


pi n 

VjIU 


\7a 1 

va.1 


inr 


155 










160 


lieu 


Fne 


lieu 


Hi c 
nl S 


Fro 

1 / J 


v ai 


biy 


oer 


Fro 


ixieu 
190 


inr 


jjeu 


Fro 


ASp 


\7a 1 

vai 
205 


pi n 
bin 


jjeu 


P 1 Tl 

bin 


Giy 


lieu 
220 


pi 
biy 


Trp 


oer 


TV -v* r*r 

Arg 


mr 


pi 

blU 


ASp 


oer 


pi 
bly 


O P, V* 

oer 


235 










240 


oer 


lie 


Jby s 


jjys 


TV v p< 

Arg 

9 


oer 


ID s~\ 

rro 


vai 


oer 


Asn 
270 


vai 


Asn 


Tl e 

lie 


blU 


p i 
biy 

285 


r*l ii 
blU 


Asn 


Met. 


biy 


inr 
300 


T 7 -j 1 
V Cll 


inr 


Fne 


O q v 

oer 


biy 


7\ r~t 

Arg 


jjys 


inr 


bin 


TV "V* /~T 

Arg 


315 










320 


V cll 


jjys 


Pin 

blu 


Oci 


7\ o-r\ 

nop 
JJJ 


7\ 1 -a 

Aia 


Hie; 

ni o 


Car 
Oci 


Jrl u 


Tip 

lie 

350 


JjcU 


Oci 


Va 1 

V Cll 


Oci 


Hi q 

ill o 

365 


Pro 

XT 1 VJ 


\7a 1 
V cil 


JjcU 


Va 1 
vai 


bxy 

380 


A cn 
ASp 


T>oii 

DC u 


XJCLl 


GT 11 
ul U 


t 1 u 


Tic, 
11c 


XJCLl 


iyr 


ni y 


Php 

XT IlC 


•J Z> *J 










400 

*i \J \J 


Ser 


Ala 


Pro 


Ser 


Gly 
415 


Gly 


Glu 


His 


Ser 


Gly 
430 


Asn 


Tyr 


Gin 


His 


Ser 
445 


His 


Gly 


Val 


Pro 


Val 


Leu 


Thr 


Leu 


Arg 



460 
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Ala Pro Gly Ala Gin Ala Val Val Gly Asp Leu Leu Glu Leu His Cys 

470 475 ^iov 

Glu Ser Leu Arg Gly Ser Phe Pro lie Leu Tyr Trp Phe Tyr His Glu 

485 490 495 

Asp Asp Thr Leu Gly Asn lie Ser Ala His Ser Gly Gly Gly Ala Ser 

500 505 510 

Phe Asn Leu Ser Leu Thr Thr Glu His Ser Gly Asn Tyr Ser Cys Glu 

515 520 525 

Ala Asp Asn Gly Leu Gly Ala Gin His Ser Lys Val Val Thr Leu Asn 

530 535 5 

Val Thr Gly Thr Ser Arg Asn Arg Thr Gly Leu Thr Ala Ala Gly lie 
(- j r 550 555 

Thr Gly Leu Val Leu Ser lie Leu Val Leu Ala Ala Ala Ala Ala Leu 

565 570 
Leu His Tyr Ala Arg Ala Arg Arg Lys Pro Gly Gly Leu Ser Ala Thr 

580 585 
Gly Thr Ser Ser His Ser Pro Ser Glu Cys Gin Glu Pro Ser Ser Ser 

595 600 605 

Arg Pro Ser Arg He Asp Pro Gin Glu Pro Thr His Ser Lys Pro Leu 

610 615 620 

Ala Pro Met Glu Leu Glu Pro Met Tyr Ser Asn Val Asn Pro Gly Asp 
cos 630 635 b * u 

Ser Asn Pro lie Tyr Ser Gin lie Trp Ser lie Gin His Thr Lys Glu 

645 650 
Asn Ser Ala Asn Cys Pro Met Met His Gin Glu His Glu Glu Leu Thr 

660 665 670 

Val Leu Tyr Ser Glu Leu Lys Lys Thr His Pro Asp Asp Ser Ala Gly 

675 680 685 

Glu Ala Ser Ser Arg Gly Arg Ala His Glu Glu Asp Asp Glu Glu Asn 

690 695 .„ 700 

Tyr Glu Asn Val Pro Arg Val Leu Leu Ala Ser Asp His 
705 715 

<210> 7 
<211> 300 
<212> DNA 

<213> Artificial Sequence 

<220> . . 

<223> Description of Artificial Sequence : /note - 

synthetic construct 

iaaagaaaaa taggaagacg ttcagccagg gatccactca ggagccttcc ~gccctcta 
ccccaagagt tcacctacct caactcacct accccagggc agctacagcc tatatatgaa 
aatgtgaatg ttgtaagtgg ggatgaggtt tattcactgg cgtactataa ccagccggag 
caggaatcag tagcagcaga aaccctgggg acacatatgg aggacaaggt "ccttagac 
atctattcca ggctgaggaa agcaaacatt acagatgtgg actatgaaga tgctatgtaa 

<210> 8 
<211> 2038 
<212> DNA 

<213> Artificial Sequence 

<220> . . 

<223> Description of Artificial Sequence : /note - 

synthetic construct 

clcgactctg aggtgcattc tttttttgat gagaggcatc tctaggtacc atccctgacc 
tggtcctcat gctgccgagg ctgttgctgt tgatctgtgc tccactctgt gaacctgccg 



60 
120 
180 
240 
300 



60 
120 
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agctgttttt gatagccagc ccctcccatc ccacagaggg gagcccagtg accctgacgt 180 

gtaagatgcc ctttctacag agttcagatg cccagttcca gttctgcttt ttcagagaca 2 40 

cccgggcctt gggcccaggc tggagcagct cccccaagct ccagatcgct gccatgtgga 300 

aagaagacac agggtcatac tggtgcgagg cacagacaat ggcgtccaaa gtcttgagga 3 60 

gcaggagatc ccagataaat gtgcacaggg tccctgtcgc tgatgtgagc ttggagactc 420 

agcccccagg aggacaggtg atggagggag acaggctggt cctcatctgc tcagttgcta 480 

tgggcacagg agacatcacc ttcctttggt acaaaggggc tgtaggttta aaccttcagt 540 

caaagaccca gcgttcactg acagcagagt atgagattcc ttcagtgagg gagagtgatg 600 

ctgagcaata ttactgtgta gctgaaaatg gctatggtcc cagccccagt gggctggtga 660 

gcatcactgt cagaatcccg gtgtctcgcc caatcctcat gctcagggct cccagggccc 7 20 

aggctgcagt ggaggatgtg ctggagcttc actgtgaggc cctgagaggc tctcctccaa 7 80 

tcctgtactg gttttatcac gaggatatca ccctggggag caggtcggcc ccctctggag 840 

gaggagcctc cttcaacctt tccctgactg aagaacattc tggaaactac tcctgtgagg 900 

ccaacaatgg cctgggggcc cagcgcagtg aggcggtgac actcaacttc acagtgccta 960 

ctggggccag aagcaatcat cttacctcag gagtcattga ggggctgctc agcacccttg 1020 

gtccagccac cgtggcctta ttattttgct acggcctcaa aagaaaaata ggaagacgtt 1080 

cagccaggga tccactcagg agccttccca gccctctacc ccaagagttc acctacctca 1140 

actcacctac cccagggcag ctacagccta tatatgaaaa tgtgaatgtt gtaagtgggg 1200 

atgaggttta ttcactggcg tactataacc agccggagca ggaatcagta gcagcagaaa 1260 

ccctggggac acatatggag gacaaggttt ccttagacat ctattccagg ctgaggaaag 1320 

caaacattac agatgtggac tatgaagatg ctatgtaagg ttatggaaga ttctgctctt 1380 

tgaaaaccat ccatgacccc aagcctcagg cctgatatgt tcttcagaga tcctggggca 144 0 

ttagctttcc agtatacctc ttctggatgc cattctccat ggcactattc cttcatctac 1500 

tgtgaagtga agttggcgca gccctgaaga aactacctag gagaactaat agacacagga 1560 

gtgacaggga ctttgttatc agaaccagat tcctgccggc tcctttgaaa acaggtcata 1620 

ttgtgctctt ctgtttacaa gaggaaacaa gatggaataa aagaaattgg gatcttgggt 1680 

tggagggaca gtgaagctta gagcacatga actcaaggtt agtgactctg caggacttca 17 40 

cagagagagc tgtgcccatc attcagtcca agtgctttct ctgcccagac agcacagaac 1800 

tccagccccg ctacttacat ggatcatcga gtttccacct aaaatatgat tctatttatt 1860 

ttgagtcact gttaccaaat tagaactaaa acaaagttac ataaaaagtt attgtgactc 1920 

cacttaattt tagtgacgta tttttgtata tataggccaa cctataccac atccaaaatt 1980 

atgtatctat tacagcccct agaagcttta taaatacagt gtgtcttctt ttattcac 2038 

<210> 9 
<211> 261 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 9 

cacaagatat caggagaaag ttctgccact aatgaaccca gaggggcttc caggccaaat 60 

cctcaagagt tcacctattc aagcccaacc ccagacatgg aggagctgca gccagtgtat 120 

gtcaatgtgg gctctgtaga tgtggatgtg gtttattctc aggtctggag catgcagcag 180 

ccagaaagct cagcaaacat caggacactt ctggagaaca aggactccca agtcatctac 240 

tcttctgtga agaaatcata a 261 

<210> 10 
<211> 2573 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 10 

ggtgaccaag agtacatctc ttttcaaata gctggattag gtcctcatgc tgctgtggtc 60 

attgctggtc atctttgatg cagtcactga acaggcagat tcgctgaccc ttgtggcgcc 120 
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ctcttctgtc ttcgaaggag acagcatcgt tctgaaatgc cagggagaac agaactggaa 
aattcagaag atggcttacc ataaggataa caaagagtta tctgttttca aaaaattctc 
agatttcctt atccaaagtg cagttttaag tgacagtggt aactatttct £agtaccaa 
aggacaactc tttctctggg ataaaacttc aaatatagta aagataaaag tccaagagct 
ctttcaacgt cctgtgctga ctgccagctc cttccagccc atcgaagggg gtccagtgag 
cctgaaatgt gagacccggc tctctccaca gaggttggat gttcaactcc jgttctgctt 
cttcagagaa aaccaggtcc tggggtcagg . ctggagcagc tctccggagc tccagatttc 
tgccgtgtgg agtgaagaca cagggtctta ctggtgcaag gcagaaacgg tgactcacag 
gatcagaaaa cagagcctcc aatcccagat tcacgtgcag agaatcccca tctctaatgt 
aagcttggag atccgggccc ccgggggaca ggtgactgaa ggacaaaaac tgatcctgct 
ctgctcagtg gctgggggta caggaaatgt cacattctcc tggtacagag ^ggccacagg 
aaccagtatg ggaaagaaaa cccagcgttc cctgtcagca gagctggaga tcccagctgt 
gaaagagagt gatgccggca aatattactg tagagctgac aacggccatg tgcctatcca 
gagcaaggtg gtgaatatcc ctgtgagaat tccagtgtct cgccctgtcc tcaccctcag 
gtctcctggg gcccaggctg cagtggggga cctgctggag cttcactgtg aggccctgag 
aggctctccc ccaatcttgt accaatttta tcatgaggat gtcacccttg ggaacagctc 
ggccccctct ggaggagggg cctccttcaa cctctctttg actgcagaac attctggaaa 
ctactcctgt gaggccaaca acggcctggg ggcccagtgc agtgaggcag tgccagtctc 
catctcagga cctgatggct atagaagaga cctcatgaca gctggagttc tctggggact 
gtttggtgtc cttggtttca ctggtgttgc tttgctgttg tatgccttgt tccacaagat 
atcaggagaa agttctgcca ctaatgaacc cagaggggct tccaggccaa atcctcaaga 
qttcacctat tcaagcccaa ccccagacat ggaggagctg cagccagtgt atgtcaatgt 
gggctctgta gatgtggatg tggtttattc tcaggtctgg agcatgcagc agccagaaag 
ctcagcaaac atcaggacac ttctggagaa caaggactcc caagtcatct actcttctgt 
gaagaaatca taacacttgg aggaatcaga agggaagatc aacagcaagg atggggcatc 
attaagactt gctataaaac cttatgaaaa tgcttgaggc ttatcacctg ccacagccag 
aacgtgcctc aggaggcacc tcctgtcatt tttgtcctga tgatgtttct tctccaatat 
cttcttttac ctatcaatat tcattgaact gctgctacat ccagacactg tgcaaataaa 
ttatttctgc taccttctct taagcaatca gtgtgtaaag atttgaggga agaatgaata 
agagatacaa ggtctcacct tcatctactg tgaagtgatg agaacaggac ttgatagtgg 
tgtattaact tatttatgtg ctgctggata cagtttgcta atattttgtt gagaattttt- 
gcaaatatgt tcattgggaa tattggcctg aaattttctt ttccactgtg tctctgccag 
aatgtttgta tcaggctgat gctggcttca tagaatgagt taggcaggag cccttcctcc 
ttgatttttt ggcatagttt cagcaggatt ggtaccagtt attctttctg ^atcttgtag 
aattcagcta tgaatccatc tggtctaggg cttttgtgtt ggttggtaag "ttttatta 
ctaattcaac ttcagcgctt gatattggtc taggaggggt ttctgtctct tcctggttca 
atcttgggag attgtgtgtt tccaggaatt tagccgtttc ctccagattt tcttctttat 
gtgcatcgac ttgagtgtaa acataactta tatgcactgg gaaaccaaaa aatctgtgtg 
acttgcttta' ttgcagcatt tgttttattt tggtagtctg gaactgaacc tgcaatatca 
ccaaagtatg catatagttg caaaaatgtg atttttgaca tagtaaatat gagtatttgc 
aataaactat gatattactt ttgtaagtat atagaataaa atgtaaataa tct 

<210> 11 
<211> 423 
<212> DNA 

<213> Artificial Sequence 

<220> . . 

<223> Description of Artificial Sequence : /note - 

synthetic construct 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2573 



•v 



60 
120 



<400> 11 v 4. 

cattacgcca gggcccgaag gaaaccagga ggactttctg ccactggaac atctagtcac 

agtcctagcg agtgtcagga gccttcctcg tccaggcctt ccaggataga ccctcaagag 
cccactcact ctaaaccact agccccaatg gagctggagc caatgtacag caatgtaaat 180 
cctggagata gcaacccgat ttattcccag atctggagca tccagcatac aaaagaaaac 240 
tcagctaatt gtccaatgat gcatcaagag catgaggaac ttacagtcct ctattcagaa 
ctgaagaaga cacacccaga cgactctgca ggggaggcta gcagcagagg cagggcccat 
gaagaagatg atgaagaaaa ctatgagaat gtaccacgtg tattactggc ctcagaccac 
tag 



300 
360 
420 
423 
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<210> 12 
<211> 2416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 



<400> 12 

gtctcatctg 

gacagacttt 

atcctgactc 

cctccatggt 

cattccctag 

catgacaaga 

ctcagtgatg 

cctgtctttg 

catcaaaagg 

atcacagtga 

ttttacatac 

tttctacatc 

ctgacctgtg 

ttcagagata 

gccatgtgga 

atcaaaaaaa 

aatctagaga 

tgctcagtag 

agaagcctgg 

aaggagagtg 

agcacgtgga 

gctcccaggg 

ggctctcccc 

gccccctctg 

tactcctgtg 

gtcacagttc 

gtgggggacc 

tggttttatc 

tccttcaacc 

•ggcctggggg 

agaacaggcc 

gctgctgctg 

actggaacat 

aggatagacc 

atgtacagca 

cagcatacaa 

acagtcctct 

agcagaggca 

ttactggcct 

attttttttt 

gaggctgggc 



agtagcagct 
cccaaccctc 
ctggaagaga 
ccacagcctt 
cccagggaga 
tccaaattac 
ccgtgcatgt 
aaggagacaa 
tttactacaa 
attcagtctc 
ttgacattga 
ctgtgctgag 
agacccagct 
gccagaccct 
ctgaagactc 
ggagcctgag 
tccggcccac 
cccagggttc 
gtagaaagac 
atgcagggag 
ttcgagtcac 
cccacactgt 
cgatcctgta 
gaggaggagc 
atgcagacaa 
cggtgtctcg 
tgctggagct 
acgaggatga 
tctctctgac 
cccagcacag 
ttaccgctgc 
ctctgctgca 
ctagtcacag 
ctcaagagcc 
atgcaaatcc 
aagaaaactc 
attcagaact 
gggcccatga 
cagaccacta 
ctgttctctc 
tgcagg 



tcctgccctc 
tgcccggccg 
acaatcaggg 
caaaggagaa 
cacatattgg 
agagcctgga 
ggaattttca 
tgtcattctg 
ggatggaaaa 
cagggataat 
agtaacttca 
agccagctct 
ctctccacag 
cggattgggc 
agggtcttac 
atctcagata 
cggagggcag 
agggactgtc 
ccagcgttcc 
atactactgt 
cgtgagaatt 
ggtgggggac 
ccgattttat 
ctccttcaac 
tggcctgggg 
ccccgtcctc 
tcactgtgag 
caccttgggg 
tacagaacat 
taaagtggtg 
gggaatcacg 
ttacgccagg 
tcctagcgag 
cactcactct 
tggagatagc 
agctaattgt 
gaagaagaca 
agaagatgat 
gccccttacc 
caaccacaca 



cttcttggag 
gtgcccatgc 
gtggccccaa 
aaagtggctc 
tatcacgatg 
aattaccaat 
cccgactggc 
agatgtcagg 
cagcttccta 
agcaaatatc 
aaacccctaa 
tccacgccca 
aggccagatg 
tggagcaggt 
tggtgtgagg 
cgtgtacaga 
ctgattgaag 
acattctcct 
ctgttggcag 
gcagctgata 
ccggtatctc 
ctgctggagc 
catgaggacg 
ctctctctga 
gcccagcaca 
accctcaggg 
tccctgagag 
aacatctcgg 
tctggaaact 
acactcaatg 
gggctggtgc 
gcccgaagga 
tgtcaggagc 
aaaccactag 
aacccgattt 
ccaatgatgc 
cacccagacg 
gaagaaaact 
cagagtggcc 
tcatccatct 



ataagtcggg 
ttctgtggct 
aagctgtact 
tcatatgcag 
agaagttgtt 
gtaagacccg 
tgatcctgca 
ggaaagacaa 
atagttataa 
attgtactgc 
atatccaagt 
tagaggggag 
tccagctgca 
cccccagact 
tggagacagt 
gagtccctgt 
gagaaaatat 
ggcacaaaga 
agctgcatgt 
acgttcacag 
accctgtcct 
ttcactgtga 
tcaccctggg 
ctgcagaaca 
gtcatggagt 
ctcccggggc 
gctccttccc 
cccactctgg 
actcatgtga 
ttacaggaac 
tcagcatcct 
aaccaggagg 
cttcctcgtc 
ccccaatgga 
attcccagat 
atcaagagca 
actctgcagg 
atgagaatgt 
cacaggaaac 
ctccagactc 



cttttggtga 
gctgctgctg 
tctcctcaat 
cagcatatca 
gaaaataaaa 
aggatcctcc 
ggctttacat 
caaaaacact 
tttagagaag 
ttataggaag 
tcaagagctg 
tcccatgacc 
attctccctc 
ccagatccct 
gactcacagc 
gtctaatgtg 
ggtccttatt 
aggaagagta 
tctcaccgtg 
ccccatcctc 
caccttcagg 
gtccctgaga 
gaacagctca 
ttctggaaac 
gagtctcagg 
ccaggctgtg 
gatcctgtac 
aggaggggca 
ggctgacaat 
ttccaggaac 
cgtccttgct 
actttctgcc 
caggccttcc 
gctggagcca 
ctggagcatc 
tgaggaactt 
ggaggctagc 
accacgtgta 
agcctgcacc 
tgcctcctac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1266 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2416 



<210> 13 
<211> 873 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 
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gagSgtttt tgatagccag cccctcccat cccacagagg ^agcccagt Jjccctgac, 
IrZaanai-ar cctttctaca gagttcagat gcccagttcc agttctgctt tttcagagac 
acccgggcct tgggcccagg ctggagcagc tcccccaagc tccagatcgc tgccatgtgg 
SS» cagggtcaL ctggtgcgag gcacagacaa t ? gcgtccaa agtcttgagj 
agcaggagat cccagataaa tgtgcacagg gtccctgtcg ctgatgtgag cttggagact 
cagcScccag gaggacaggt gatggaggga gacaggctgg tcctcatctg ^cagttgct 
ataqqcacag gagacatcac cttcctttgg tacaaagggg ctgtaggttt aaaccttcag 
tcaaagaccc agcgttcact gacagcagag tatgagattc cttcagtgag ggagagtgat 
gctgagcaat attactgtgt agctgaaaat ggctatggtc ccagccccag tgggctggtg 
agcatcactg tcagaa?ccc ggtgtctcgc ccaatcctca tgctcagggc tcccagggcc 
caggctgcag tggaggatgt gctggagctt cactgtgagg ccctgagagg ctctcctcca 
atcctgtact ggttttatca cgaggatatc accctgggga gcaggtcggc cccctctgga 
ggaggagcct ccttcaacct t?ccctgact gaagaacatt ctggaaacta ctcctgtgag 
gccaacaatg gcctgggggc ccagcgcagt gaggcggtga cactcaactt cacagtgcct 
actggggcca gaagcaatca tcttacctca gga 

<210> 14 
<211> 1137 
<212> DNA 

<213> Artificial Sequence 

<220> . u 

<223> Description of Artificial Sequence: /note - 

synthetic construct 

lllTttllgg cgccctcttc tgtcttcgaa ggagacagca tcgttctgaa atgccaggga 
gaacagaact ggaaaattca gaagatggct taccataagg ataacaaaga gttatctgtt 
?tcaaaaaat tctcagattt ccttatccaa agtgcagttt taagtgacag ^ggtaactat 
ttctgtagta ccaaaggaca actctttctc tgggataaaa cttcaaatat agtaaagata 
aaag?ccaag agctctttca acgtcctgtg ctgactgcca gctccttcca gcccatcgaa 
gggggtccag tgagcctgaa atgtgagacc cggctctctc cacagaggtt ggatgttcaa 
ctcSgttct gcttcttcag agaaaaccag gtcctggggt caggctggag cagctctccg 
gagctccaga tttctgccgt gtggagtgaa gacacagggt cttactggtg caaggcagaa 
acggtgactc acaggatcag aaaacagagc ctccaatccc agattcacgt gcagagaatc 
cccatctcta atgtaagctt ggagatccgg gcccccgggg gacaggtgac tgaaggacaa 
aaactgatcc tgctctgctc agtggctggg ggtacaggaa atgtcacatt ctcctggtac 
agagaggcca caggaaccag tatgggaaag aaaacccagc gttccctgtc agcagagctg 
gagaticcag ctgtgaaaga gagtgatgcc ggcaaatatt actgtagagc tgacaacggc 
catgtgccta tccagagcaa ggtggtgaat atccctgtga gaattccagt gtctcgccct 
gtcctcaccc tcaggtctcc tggggcccag gctgcagtgg gggacctgct ggagcttcac 
?gtgaggccc tgagaggctc tcccccaatc ttgtaccaat tttatcatga OTjtgtcacc 
cttgggaaca gctcggcccc ctctggagga ggggcctcct tcaacctctc tttgactgca 
gaacattctg gaaactactc ctgtgaggcc aacaacggcc tgggggccca gtgcagtgag 
gcagtgccag tctccatctc aggacctgat ggctatagaa gagacctcat gacagct 

<210> 15 
<211> 1659 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note - 
synthetic construct 

ataaccccaa aagctgtact tctcctcaat cctccatggt ccacagcctt caaaggagaa 
aaagtggctc tcatatgcag cagcatatca cattccctag cccagggaga cacatattgg 
tatcacqatg agaagttgtt gaaaataaaa catgacaaga tccaaattac agagcctgga 
aattaccS gLagacccg aggatcctcc ctcagtgatg ccgtgcatgt ggaattttca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
873 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1137 



60 
120 
180 
240 
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cccgactggc tgatcctgca ggctttacat cctgtctttg aaggagacaa tgtcattctg 300 

agatgtcagg ggaaagacaa caaaaacact catcaaaagg tttactacaa ggatggaaaa 360 

cagcttccta atagttataa tttagagaag atcacagtga attcagtctc cagggataat 420 

agcaaatatc attgtactgc ttataggaag ttttacatac ttgacattga agtaacttca 480 

aaacccctaa atatccaagt tcaagagctg tttctacatc ctgtgctgag agccagctct 540 

tccacgccca tagaggggag tcccatgacc ctgacctgtg agacccagct ctctccacag 600 

aggccagatg tccagctgca attctccctc ttcagagata gccagaccct cggattgggc 660 

tggagcaggt cccccagact ccagatccct gccatgtgga ctgaagactc agggtcttac 720 

tggtgtgagg tggagacagt gactcacagc atcaaaaaaa ggagcctgag atctcagata 780 

cgtgtacaga gagtccctgt gtctaatgtg aatctagaga tccggcccac cggagggcag 84 0 

ctgattgaag gagaaaatat ggtccttatt tgctcagtag cccagggttc agggactgtc 900 

acattctcct ggcacaaaga aggaagagta agaagcctgg gtagaaagac ccagcgttcc 960 

ctgttggcag agctgcatgt tctcaccgtg aaggagagtg atgcagggag atactactgt 1020 

gcagctgata acgttcacag ccccatcctc agcacgtgga ttcgagtcac cgtgagaatt 1080 

ccggtatctc accctgtcct caccttcagg gctcccaggg cccacactgt ggtgggggac 1140 

ctgctggagc ttcactgtga gtccctgaga ggctctcccc cgatcctgta ccgattttat 1200 

catgaggacg tcaccctggg gaacagctca gccccctctg gaggaggagc ctccttcaac 1260 

ctctctctga ctgcagaaca ttctggaaac tactcctgtg atgcagacaa tggcctgggg 1320 

gcccagcaca gtcatggagt gagtctcagg gtcacagttc cggtgtctcg ccccgtcctc 1380 

accctcaggg ctcccggggc ccaggctgtg gtgggggacc tgctggagct tcactgtgag 14 40 

tccctgagag gctccttccc gatcctgtac tggttttatc acgaggatga caccttgggg 1500 

aacatctcgg cccactctgg aggaggggca tccttcaacc tctctctgac tacagaacat 1560 

tctggaaact actcatgtga ggctgacaat ggcctggggg cccagcacag taaagtggtg 1620 

acactcaatg ttacaggaac ttccaggaac agaacaggc 1659 

<210> 16 
<211> 423 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 16 

cattacgcca gggcccgaag gaaaccagga ggactttctg ccactggaac atctagtcac 60 

agtcctagcg agtgtcagga gccttcctcg tccaggcctt ccaggataga ccctcaagag 12 0 

cccactcact ctaaaccact agccccaatg gagctggagc caatgtacag caatgcaaat 180 

cctggagata gcaacccgat ttattcccag atctggagca tccagcatac aaaagaaaac 240 

tcagctaatt gtccaatgat gcatcaagag catgaggaac ttacagtcct ctattcagaa 300 

ctgaagaaga cacacccaga cgactctgca ggggaggcta gcagcagagg cagggcccat 360 

gaagaagatg atgaagaaaa ctatgagaat gtaccacgtg tattactggc ctcagaccac 420 

tag 423 

<210> 17 
<211> 2151 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 17 

gtggccccaa aagctgtact tctcctcaat cctccatggt ccacagcctt caaaggagaa 60 

aaagtggctc tcatatgcag cagcatatca cattccctag cccagggaga cacatattgg 120 

tatcacgatg agaagttgtt gaaaataaaa catgacaaga tccaaattac agagcctgga 180 

aattaccaat gtaagacccg aggatcctcc ctcagtgatg ccgtgcatgt ggaattttca 240 

cccgactggc tgatcctgca ggctttacat cctgtctttg aaggagacaa tgtcattctg 300 

agatgtcagg ggaaagacaa caaaaacact catcaaaagg tttactacaa ggatggaaaa 360 

cagcttccta atagttataa tttagagaag atcacagtga attcagtctc cagggataat 420 
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aacaaatatc attgtactgc ttataggaag ttttacatac ttgacattga agtaacttca 
aaaccccJaa ata?ccaagt tcaagagctg tttctacatc ctgtgctgag agccagctet 
tccacgccca tagaggggag tcccatgacc ctgacctgtg agacccagct ctctccacag 
aggccagatg tccagctgca attctccctc ttcagagata gccagaccct cggattgggc 
Jggagcaggt cccccagact ccagatccct gccatgtgga ctgaagactc agggtcttac 
tggtgtgagg tggagacagt gactcacagc atcaaaaaaa ggagcctgag atctcagata 
cg?g?acaga gagtccctgt gtctaatgtg aatctagaga tccggcccac cggagggcag 
ctgattgaag gagaaaatat ggtccttatt tgctcagtag cccagggttc agggactgtc 
acattctcc? ggcacaaaga aggaagagta agaagcctgg gtagaaagac c«gcgttcc 
ctgttggcag agctgcatgt tctcaccgtg aaggagagtg atgcagggag. atactactgt 
qcagctgata acgttcacag ccccatcctc agcacgtgga ttcgagtcac cgtgagaatt 
ccggtatctc accctgtcct caccttcagg gctcccaggg cccacactgt ggtgggggac 
ctgctggagc ttcactgtga gtccctgaga ggctctcccc cgatcctgta ccgattttat 
ca?gaggacg tcaccctggg gaacagctca gccccctctg gaggaggagc ctccttcaac 
ctctctctga ctgcagaaca ttctggaaac tactcctgtg atgcagacaa tggcctgggg 
gcccagcaca gtcatggagt gagtctcagg gtcacagttc cggtgtctcg c^^cctc 
accctcaggg ctcccggggc ccaggctgtg gtgggggacc tgctggagct tcactgtgag 
tccctgagag gctccttccc gatcctgtac tggttttatc acgaggatga caccttgggg 
aacatctcgg cccactctgg aggaggggca tccttcaacc tctctctgac tacagaacat 
tctggaaact actcatgtga ggctgacaat ggcctggggg cccagcacag taaagtggtg 
acactcaatq ttacaggaac ttccaggaac agaacaggcc ttaccgctgc gggaatcacg 
gggctgg?gc tcagca?cct cgtccttgct gctgctgctg ctctgctgca ttacgccagg 
gcccgaagga aaccaggagg actttctgcc actggaacat ctagtcacag tcctagcgag 
tgtcaggagc cttcctcgtc caggccttcc aggatagacc ctcaagagcc cactcactct 
aaaccactag ccccaatgga gctggagcca atgtacagca atgcaaatcc tggagatagc 
aacccgattt attcccagat ctggagcatc cagcatacaa aagaaaactc agctaattgt 
ccaatgatgc atcaagagca tgaggaactt acagtcctct attcagaact gaagaagaca 
cacccagacg actctgcagg ggaggctagc agcagaggca gggcccatga agaagatgat 
gaagaaaact atgagaatgt accacgtgta ttactggcct cagaccacta g 

<210> 18 
<211> 315 
<212> DNA 

<213> Artificial Sequence 

<220> . . 

<223> Description of Artificial Sequence : /note - 

synthetic construct 

iga^cctgga gaaaagctgg gccccttcca tcccagatac cacccacagc tccaggtgga 
gagcagtgcc cactatatgc caacgtgcat caccagaaag ggaaagatga aggtgttgtc 
tactctgtgg tgcatagaac ctcaaagagg agtgaagcca ggtctgctga gttcaccgtg 
qggagaaagg acagttctat catctgtgcg gaggtgagat gcctgcagcc cagtgaggtt 
?catccacgg aggtgaatat gagaagcagg actctccaag aaccccttag cgactgtgag 
gaggttctct gctag 

<210> 19 
<211> 870 
<212> DNA 

<213> Artificial Sequence 

<220> . u 

<223> Description of Artificial Sequence : /note - 

synthetic construct 

actgtctggc tgtacctcca agcctggcca aaccctgtgt ttgaaggaga tgccctgact 
ctqcgatgtc agggatggaa gaatacacca ctgtctcagg tgaagttcta cagagatgga 
aaattccttc atttctctaa ggaaaaccag actctgtcca tgggagcagc aacagtgcag 
agccgtggcc agtacagctg ctctgggcag gtgatgtata ttccacagac attcacacaa 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2151 



60 
120 
180 
240 
300 
315 



60 
120 
180 
240 
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acttcagaga ctgccatggt tcaagtccaa gagctgtttc cacctcctgt gctgagtgcc 300 

atcccctctc ctgagccccg agagggtagc ctggtgaccc tgagatgtca gacaaagctg 360 

caccccctga ggtcagcctt gaggctcctt ttctccttcc acaaggacgg ccacaccttg 420 

caggacaggg gccctcaccc agaactctgc atcccgggag ccaaggaggg agactctggg 480 

ctttactggt gtgaggtggc ccctgagggt ggccaggtcc agaagcagag cccccagctg 54 0 

gaggtcagag tgcaggctcc tgtatcccgt cctgtgctca ctctgcacca cgggcctgct 600 

gaccctgctg tgggggacat ggtgcagctc ctctgtgagg cacagagggg ctcccctccg 660 

atcctgtatt ccttctacct tgatgagaag attgtgggga accactcagc tccctgtggt 720 

ggaaccacct ccctcctctt cccagtgaag tcagaacagg atgctgggaa ctactcctgc 7 80 

gaggctgaga acagtgtctc cagagagagg agtgagccca agaagctgtc tctgaagggt 8 40 

tctcaagtct tgttcactcc cgccagcaac 870 



<210> 20 
<211> 1257 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 20 

actgtctggc tgtacctcca agcctggcca aaccctgtgt ttgaaggaga tgccctgact 60 

ctgcgatgtc agggatggaa gaatacacca ctgtctcagg tgaagttcta cagagatgga 120 

aaattccttc atttctctaa ggaaaaccag actctgtcca tgggagcagc aacagtgcag 180 

agccgtggcc agtacagctg ctctgggcag gtgatgtata ttccacagac attcacacaa 240 

acttcagaga ctgccatggt tcaagtccaa gagctgtttc cacctcctgt gctgagtgcc 300 

atcccctctc ctgagccccg agagggtagc ctggtgaccc tgagatgtca gacaaagctg 3 60 

caccccctga ggtcagcctt gaggctcctt ttctccttcc acaaggacgg ccacaccttg 4 20 

caggacaggg gccctcaccc agaactctgc atcccgggag ccaaggaggg agactctggg 4 80 

ctttactggt gtgaggtggc ccctgagggt ggccaggtcc agaagcagag cccccagctg 540 

gaggtcagag tgcaggctcc tgtatcccgt cctgtgctca ctctgcacca cgggcctgct 600 

gaccctgctg tgggggacat ggtgcagctc ctctgtgagg cacagagggg ctcccctccg 660 

atcctgtatt ccttctacct tgatgagaag attgtgggga accactcagc tccctgtggt 720 

ggaaccacct ccctcctctt cccagtgaag tcagaacagg atgctgggaa ctactcctgc 780 

gaggctgaga acagtgtctc cagagagagg agtgagccca agaagctgtc tctgaagggt 840 

tctcaagtct tgttcactcc cgccagcaac tggctggttc cttggcttcc tgcgagcctg 900 

cttggcctga tggttattgc tgctgcactt ctggtttatg tgagatcctg gagaaaagct 960 

gggccccttc catcccagat accacccaca gctccaggtg gagagcagtg cccactatat 1020 

gccaacgtgc atcaccagaa agggaaagat gaaggtgttg tctactctgt ggtgcataga 1080 

acctcaaaga ggagtgaagc caggtctgct gagttcaccg tggggagaaa ggacagttct 114 0 

atcatctgtg cggaggtgag atgcctgcag cccagtgagg tttcatccac ggaggtgaat 1200 

atgagaagca ggactctcca agaacccctt agcgactgtg aggaggttct ctgctag 1257 



<210> 21 
<211> 292 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 21 

Ala Glu Leu Phe Leu lie Ala Ser Pro Ser His Pro Thr Glu Gly Ser 

15 10 15 

Pro Val Thr Leu Thr Cys Lys Met Pro Phe Leu Gin Ser Ser Asp Ala 

20 25 30 

Gin Phe Gin Phe Cys Phe Phe Arg Asp Thr Arg Ala Leu Gly Pro Gly 
35 40 45 
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Trp Ser Ser Ser Pro Lys Leu Gin He Ala Ala Met Trp Lys Glu Asp 

Thr Sy Ser Tyr Trp Cys Glu Ala Gin Thr Met Ala Ser Lys Val Leu 

£g Ser Arg Arg Ser lln lie Asn Val His Arg Val Pre Val Ala Asp 

Val Ser Leu Glu Thr Gin Pro Pro Gly Gly Gin Val Met Glu Gly Asp 

Arg Leu Val Leu lie Cys Ser Val Ala Met Gly Thr Gly Asp He Thr 

Phe Leu Trp Tyr Lys Gly Ala III Gly Leu Asn Leu Gin Ser Lys Thr 

Gin Arg" Ser Leu Thr Ala Glu Tyr Glu He - Pro Ser Val Arg Glu Ser 

HI Ala Glu Gin Tyr Tyr Cys Val Ala Glu Asn Gly Tyr Gly Pro Ser 

165 170 
Pro Ser Gly Leu Val Ser He Thr Val Arg He Pro Val Ser Arg Pro 

lie Leu Met III Arg Ala Pro Arg Ala Gin Ala Ala Val Glu Asp Val 

iqc 200 
Leu Glu Leu His Cys Glu Ala Leu Arg Gly Ser Pro Pro He Leu Tyr 

Trp III Tyr His Glu Asp He Thr Leu Gly Ser Arg Ser Ala Pro Ser 

q ~ 230 

Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Glu Glu His Ser Gly 

Asn Tyr Ser Cys III Ala Asn Asn Gly Leu Gly Ala Gin Arg Ser Glu 

Ala Val Thr III Asn Phe Thr Val Pro Thr Gly Ala Arg Ser Asn His 

275 280 285 

Leu Thr Ser Gly 
290 

<210> 22 
<211> 380 
<212> PRT 

<213> Artificial Sequence 

<220> , . A _ 

<223> Description of Artificial Sequence : /note - 

synthetic construct 

Leu°Thr 2 Leu Val Ala Pro Ser Ser Val Phe Glu Gly Asp Ser lie Val 

1 5 10 

Leu Lys Cys Gin Gly Glu Gin Asn Trp Lys He Gin Lys Met Ala Tyr 

His Lys Asp Asn Lys Glu Leu Ser Val Phe Lys Lys Phe Ser Asp Phe 

Leu lie Gin Ser Ala Val Leu Ser Asp Ser Gly Asn Tyr Phe Cys Ser 

50 55 
Thr Lys Gly Gin Leu Phe Leu Trp Asp Lys Thr Ser Asn He Val Lys 

70 75 „ 

lie Lys Val Gin Glu Leu Phe Gin Arg Pro Val Leu Thr Ala Ser Ser 

Phe Gin Pro lie Glu Gly Gly Pro Val Ser Leu Lys Cys Glu Thr Arg 

105 1 1U 



Leu Ser Pro Gin Arg Leu Asp Val Gin Leu Gin Phe Cys Phe Phe Arg 

115 I 20 1 

Glu Asn Gin Val Leu Gly Ser Gly Trp Ser Ser Ser Pro Glu Leu Gin 

130 135 140 
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He 


Ser 


Ala 


Val 


Trp 


Ser 


Glu 


Asp 


145 










150 






Glu 


Thr 


Val 


Thr 


His 


Arg 


He 


Arg 










165 








His 


Val 


Gin Arg 


He 


Pro 


He 


Ser 








180 










Pro 


Gly Gly Gin 


Val 


Thr 


Glu 


Gly 






195 










200 


Val 


Ala Gly Gly 


Thr 


Gly 


Asn 


Val 




210 










215 




Thr 


Gly 


Thr 


Ser 


Met 


Gly 


Lys 


Lys 


225 










230 






Leu 


Glu 


He 


Pro 


Ala 


Val 


Lys 


Glu 










245 








Arg 


Ala 


Asp 


Asn 


Gly His 


Val 


Pro 








260 










Pro 


Val 


Arg 


He 


Pro 


Val 


Ser 


Arg 






275 










280 


Gly 


Ala 


Gin 


Ala 


Ala 


Val 


Gly 


Asp 




290 










295 




Leu 


Arg 


Gly 


Ser 


Pro 


Pro 


He 


Leu 


305 










310 






Thr 


Leu Gly Asn 


Ser 


Ser 


Ala 


Pro 










325 








Leu 


Ser 


Leu 


Thr 


Ala 


Glu 


His 


Ser 








340 










Asn 


Gly Leu 


Gly 


Ala 


Gin 


Cys 


Ser 






355 










360 


Gly 


Pro 


Asp 


Gly 


Tyr Arg 


Arg 


Asp 




370 










375 





<210> 23 
<211> 140 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthetic construct 

<400> 23 



His 


Tyr 


Ala 


Arg Ala Arg Arg 


Lys 


1 








5 




Thr 


Ser 


Ser 


His 


Ser Pro Ser 


Glu 








20 






Pro 


Ser 


Arg 


He Asp Pro Gin 


Glu 






35 






40 


Pro 


Met 


Glu 


Leu 


Glu Pro Met 


Tyr 




50 






55 




Asn 


Pro 


He 


Tyr 


Ser Gin He 


Trp 


65 








70 




Ser 


Ala 


Asn 


Cys 


Pro Met Met 


His 










85 




Leu 


Tyr 


Ser 


Glu 


Leu Lys Lys 


Thr 








100 






Ala 


Ser 


Ser 


Arg 


Gly Arg Ala 


His 






115 






120 


Glu 


Asn 


Val 


Pro 


Arg Val Leu 


Leu 




130 






135 





<210> 24 



Thr 


Gly 


Ser 


Tyr 


Trp Cys 


Lys 


Ala 






155 










160 


Lys 


Gin 
170 


Ser 


Leu 


Gin 


Ser 


Gin 
175 


He 


Asn 


Val 


Ser 


Leu 


Glu 


lie Arg Ala 


185 










190 






Gin 


Lys 


Leu 


lie 


Leu 
205 


Leu 


Cys 


Ser 


Thr 


Phe 


Ser 


Trp 


Tyr Arg 


Glu 


Ala 








220 










Thr 


Gin 


Arg 
235 


Ser 


Leu 


Ser 


Ala 


Glu 
240 


Ser 


Asp 
250 


Ala 


Gly 


Lys 


Tyr 


Tyr 
255 


Cys 


He 


Gin 


Ser 


Lys 


Val 


Val 


Asn 


He 


265 










270 






Pro 


Val 


Leu 


Thr 


Leu Arg 


Ser 


Pro 










285 








Leu 


Leu 


Glu 


Leu 
300 


His 


Cys 


Glu 


Ala 


Tyr 


Gin 


Phe 


Tyr 


His 


Glu Asp 


Val 






315 










320 


Ser 


Gly 


Gly Gly 


Ala 


Ser 


Phe 


Asn 




330 










335 




Gly 


Asn 


Tyr 


Ser 


Cys 


Glu 


Ala 


Asn 


345 










350 






Glu 


Ala 


Val 


Pro 


Val 
365 


Ser 


He 


Ser 


Leu 


Met 


Thr 


Ala 
380 











Sequence: /note = 



Pro 


Gly 


Gly 


Leu 


Ser Ala Thr 


Gly 




10 






15 




Cys 


Gin 


Glu 


Pro 


Ser Ser Ser 


Arg 


25 








30 




Pro 


Thr 


His 


Ser 


Lys Pro Leu 


Ala 










45 




Ser 


Asn 


Ala 


Asn 


Pro Gly Asp 


Ser 








60 






Ser 


He 


Gin 


His 


Thr Lys Glu 


Asn 






75 






80 


Gin 


Glu 


His 


Glu 


Glu Leu Thr 


Val 




90 






95 




His 


Pro 


Asp 


Asp 


Ser Ala Gly Glu 


105 








110 




Glu 


Glu 


Asp 


Asp 


Glu Glu Asn 


Tyr 










125 




Ala 


Ser 


Asp 


His 







140 
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<211> 554 
<212> PRT 

<213> Artificial Sequence 

<220> / *. _ 

<223> Description of Artificial Sequence : /note - 

synthetic construct 

<C400> 24 m , 
Gly Val Ala Pro Lys Ala Val Leu Leu -Leu Asn Pro Pro Trp Ser Thr 

1 5 10 

Ala Phe Lys Gly Glu Lys Val Ala Leu lie Cys Ser Ser lie Ser His 

20 25 3° 

Ser Leu Ala Gin Gly Asp Thr Tyr Trp Tyr His Asp Glu Lys Leu Leu 

Lys lie Lys His Asp Lys lie Gin lie Thr Glu Pro Gly Asn Tyr Gin 
50 55 60 

_ rrv,v 7\ v^-t ciw .q<=>r s^r Leu Ser Asp 

75 80 



ill 

Cys Lys Thr Arg Gly Ser Ser Leu Ser Asp Ala Val His Val Glu Phe 

65 70 75 

Ser Pro Asp Trp Leu He Leu Gin Ala Leu His Pro Val Phe Glu Gly 

""""" ~ * 85 9° 95 

Asp Asn Val He Leu Arg Cys Gin Gly Lys Asp Asn Lys Asn Thr His 

100 105 H° 

Gin Lys Val Tyr Tyr Lys Asp Gly Lys Gin Leu Pro Asn Ser Tyr Asn 

115 ~ 120 125 

Leu Glu Lys He Thr Val Asn Ser Val Ser Arg Asp Asn Ser Lys Tyr - 

130 135 I 40 

His Cys Thr Ala Tyr Arg Lys Phe Tyr He Leu Asp He Glu Val Thr 
145 150 155 1«0 

Ser Lys Pro Leu Asn He Gin Val Gin Glu Leu Phe Leu His Pro Val 

165 170 l' 5 

Leu Arg Ala Ser Ser Ser Thr Pro He Glu Gly Ser Pro Met Thr Leu 

180 185 19° 

Thr Cys Glu Thr Gin Leu Ser Pro Gin Arg Pro Asp Val Gin Leu Gin 

195 200 205 

Phe Ser Leu Phe Arg Asp Ser Gin Thr Leu Gly Leu Gly Trp Ser Arg 

210 215 220 

Ser Pro Arg Leu Gin He Pro Ala Met Trp Thr Glu Asp Ser Gly Ser 
225 230 235 24U 

Tyr Trp Cys Glu Val Glu Thr Val Thr His Ser He Lys Lys Arg Ser 

245 250 255 

Leu Arg Ser Gin He Arg Val Gin Arg Val Pro Val Ser Asn Val Asn 

260 265 270 

Leu Glu He Arg Pro Thr Gly Gly Gin Leu He Glu Gly Glu Asn Met 

275 280 285 

Val Leu He Cys Ser Val Ala Gin Gly Ser Gly Thr Val Thr Phe Ser 

290 295 300 

Trp His Lys Glu Gly Arg Val Arg Ser Leu Gly Arg Lys Thr Gin Arg 
305 310 315 320 

Ser Leu Leu Ala Glu Leu His Val Leu Thr Val Lys Glu Ser Asp Ala 

325 330 335 

Gly Arg Tyr Tyr Cys Ala Ala Asp Asn Val His Ser Pro He Leu Ser 

J 340 345 350 

Thr Trp He Arg Val Thr Val Arg He Pro Val Ser His Pro Val Leu 

355 360 365 

Thr Phe Arg Ala Pro Arg Ala His Thr Val Val Gly Asp Leu Leu Glu 

•5-i 0 ' 375 380 

Leu His Cys Glu Ser Leu Arg Gly Ser Pro Pro He Leu Tyr Arg Phe 
385 390 395 400 

Tyr His Glu Asp Val Thr Leu Gly Asn Ser Ser Ala Pro Ser Gly Gly 

405 410 415 
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Gly 


Ala 


Ser 


Phe 
420 


Asn 


Leu 


Ser 


Leu 


Ser 


Cys 


Asp 
435 


Ala 


Asp 


Asn 


Gly 


Leu 
440 


Ser 


Leu 
450 


Arg 


Val 


Thr 


Val 


Pro 
455 


Val 


Ala 


Pro 


Gly Ala 


Gin 


Ala 


Val 


Val 


465 










470 






Glu 


Ser 


Leu 


Arg 


Gly 
485 


Ser 


Phe 


Pro 


Asp 


Asp 


Thr 


Leu 
500 


Gly 


Asn 


He 


Ser 


Phe 


Asn 


Leu 
515 


Ser 


Leu 


Thr 


Thr 


Glu 
520 

Km* \ m t 


Ala 


Asp 
530 


Asn 


Gly 


Leu 


Gly 


Ala 
535 


Gin 


Val 


Thr 


Gly 


Thr 


Ser 


Arg 


Asn 


Arg 


545 










550 







<210> 25 
<211> 717 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthetic construct 



<400> 25 



Gly 


Val 


Ala 


Pro 


Lys 


Ala 


Val 


Leu 


1 








5 








Ala 


Phe 


Lys 


Gly 


Glu 


Lys 


Val 


Ala 








20 










Ser 


Leu 


Ala 


Gin 


Gly 


Asp 


Thr 


Tyr 






35 










40 


Lys 


He 


Lys 


His 


Asp 


Lys 


He 


Gin 




50 










55 




Cys 


Lys 


Thr 


Arg 


Gly 


Ser 


Ser 


Leu 


65 










70 






Ser 


Pro 


Asp 


Trp 


Leu 


He 


Leu 


Gin 










85 








Asp 


Asn 


Val 


He 


Leu 


Arg 


Cys 


Gin 








100 










Gin 


Lys 


Val 


Tyr 


Tyr 


Lys 


Asp 


Gly 






115 










120 


Leu 


Glu 


Lys 


He 


Thr 


Val 


Asn 


Ser 




130 










135 




His 


Cys 


Thr 


Ala 


Tyr 


Arg 


Lys 


Phe 


145 










150 






Ser 


Lys 


Pro 


Leu 


Asn 


He 


Gin 


Val 










165 








Leu 


Arg 


Ala 


Ser 


Ser 


Ser 


Thr 


Pro 








180 










Thr 


Cys 


Glu 


Thr 


Gin 


Leu 


Ser 


Pro 






195 










200 


Phe 


Ser 


Leu 


Phe 


Arg 


Asp 


Ser 


Gin 




210 










215 




Ser 


Pro 


Arg 


Leu 


Gin 


He 


Pro 


Ala 


225 










230 






Tyr 


Trp 


Cys 


Glu 


Val 


Glu 


Thr 


Val 



245 



Thr 


Ala 


Glu 


His 


Ser Gly Asn Tyr 


425 








430 


Gly 


Ala 


Gin 


His 


Ser His Gly Val 










445 


Ser 


Arg 


Pro 


Val 


Leu Thr Leu Arg 








460 




Gly 


Asp 


Leu 


Leu 


Glu Leu His Cys 






475 




480 


He 


Leu 


Tyr 


Trp 


Phe Tyr His Glu 




490 






495 


Ala 


His 


Ser 


Gly Gly Gly Ala Ser 


505 








510 


His 


Ser 


Gly 


Asn 


Tyr Ser Cys Glu 










525 


His 


Ser 


Lys 


Val 


Val Thr Leu Asn 








540 


* 


Thr 


Gly 









Sequence: /note = 



Leu 


Leu 


Asn 


Pro 


Pro 


Trp 


Ser 


Thr 




10 










15 




Leu 


He 


Cys 


Ser 


Ser 


He 


Ser 


His 


25 










30 






Trp 


Tyr 


His 


Asp 


Glu Lys 


Leu 


Leu 










45 








He 


Thr 


Glu 


Pro 


Gly Asn Tyr 


Gin 








60 










Ser 


Asp 


Ala 


Val 


His 


Val 


Glu 


Phe 






75 










80 


Ala 


Leu 


His 


Pro 


Val 


Phe 


Glu 


Gly 




90 










95 




Gly 


Lys 


Asp 


Asn 


Lys 


Asn 


Thr 


His 


105 










110 






Lys 


Gin 


Leu 


Pro 


Asn 


Ser 


Tyr 


Asn 










125 








Val 


Ser 


Arg 


Asp 


Asn 


Ser 


Lys 


Tyr 








140 










Tyr 


He 


Leu 


Asp 


He 


Glu 


Val 


Thr 






155 










160 


Gin 


Glu 


Leu 


Phe 


Leu 


His 


Pro 


Val 




170 










175 




He 


Glu 


Gly 


Ser 


Pro 


Met 


Thr 


Leu 


185 










190 






Gin 


Arg 


Pro 


Asp Val 


Gin 


Leu 


Gin 










205 








Thr 


Leu 


Gly 


Leu 


Gly Trp 


Ser Arg 








220 










Met 


Trp 


Thr 


Glu 


Asp 


Ser 


Gly 


Ser 






235 










240 


Thr 


His 


Ser 


He 


Lys 


Lys Arg 


Ser 




250 










255 
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Leu Arg Ser Gin 

260 

Leu Glu He Arg 
275 

Val Leu He Cys 
290 

Trp His Lys Glu 
305 

Ser Leu Leu Ala 

Gly Arg Tyr Tyr 

340 

Thr Trp He Arg 
355 

Thr Phe Arg Ala 
370 

Leu His Cys Glu 
385 

Tyr His Glu Asp 

Gly Ala Ser Phe 

420 

Ser Cys Asp Ala 
435 

Ser Leu Arg Val 
450 

Ala Pro Gly Ala 
465 

Glu Ser Leu Arg 



He Arg 

Pro Thr 

Ser Val 

Gly Arg 
310 
Glu Leu 
325 

Cys Ala 
Val Thr 
Pro Arg 



Val Gin Arg 
265 

Gly Gly Gin 

280 
Ala Gin Gly 
295 

Val Arg Ser 

His Val Leu 

Ala Asp Asn 
345 

Val Arg He 

360 
Ala His Thr 
375 

Arg Gly Ser 



Ser Leu 
390 

Val Thr Leu Gly Asn 
405 

Asn Leu 



Asp Asp Thr Leu 

500 

Phe Asn Leu Ser 
515 

Ala Asp Asn Gly 
530 

Val Thr Gly Thr 
545 

Thr Gly Leu Val 

Leu His Tyr Ala 

580 

Gly Thr Ser Ser 
595 

Arg Pro Ser Arg 
610 

Ala Pro Met Glu 
625 

Ser Asn Pro He 

Asn Ser Ala Asn 

660 

Val Leu Tyr Ser 
675 

Glu Ala Ser Ser 
690 

Tyr Glu Asn Val 
705 



Asp Asn 

Thr Val 

Gin Ala 
470 
Gly Ser 
485 

Gly Asn 



Leu Thr 

Leu Gly 

Ser Arg 
550 
Leu Ser 
565 

Arg Ala 
His Ser 
He Asp 



Leu Glu 
630 
Tyr Ser 
645 

Cys Pro 

Glu Leu 

Arg Gly 

Pro Arg 
710 



Ser Leu Thr 
425 

Gly Leu Gly 

440 
Pro Val Ser 
455 

Val Val Gly 

Phe Pro He 

lie Ser Ala 
505 

Thr Glu His 

520 
Ala Gin His 
535 

Asn Arg Thr 

He Leu Val 

Arg Arg Lys 
585 

Pro Ser Glu 

600 
Pro Gin Glu 
615 

Pro Met Tyr 



Val Pro Val 

Leu He Glu 

Ser Gly Thr 
300 

Leu Gly Arg 

315 
Thr Val Lys 
330 

Val His Ser 

Pro Val Ser 

Val Val Gly 
380 

Pro Pro He 

395 
Ser Ser Ala 
410 

Ala Glu His 



Ser Asn Val Asn 

270 • 
Gly Glu Asn Met 
285 

Val Thr Phe Ser 



Ala Gin His 



Arg Pro Val 
460 

Asp Leu Leu 

475 
Leu Tyr Trp 
490 

His Ser Gly 



Lys Thr Gin 

Glu Ser Asp 
335 

Pro He Leu 

350 
His Pro Val 
365 

Asp Leu Leu 

Leu Tyr Arg 

Pro Ser Gly 
415 

Ser Gly Asn 

430 
Ser His Gly 
445 

Leu Thr Leu 



Arg 
320 
Ala 

Ser 

Leu 

Glu 

Phe 
400 
Gly 

Tyr 

Val 

Arg 



Glu Leu . His Cys 

480 

Phe Tyr His Glu 
495 

Gly Gly Ala Ser 
510 



Ser Gly Asn Tyr Ser Cys Glu 

525 

Val Thr Leu Asn 



Gin He Trp 

Met Met His 
665 

Lys Lys Thr 

680 
Arg Ala His 
695 

Val Leu Leu 



Ser Lys Val 
540 

Gly Leu Thr 

555 
Leu Ala Ala 
570 

Pro Gly Gly 

Cys Gin Glu 

Pro Thr His 
620 

Ser Asn Ala 

635 
Ser He Gin 
650 

Gin Glu His 



Ala Ala Gly He 

560 

Ala Ala Ala Leu 
575 

Leu Ser Ala Thr 
590 

Pro Ser Ser Ser 
605 

Ser Lys Pro Leu 

Asn Pro Gly Asp 

640 

His Thr Lys Glu 
655 

Glu Glu Leu Thr 
670 

His Pro Asp Asp Ser Ala Gly 

685 

Asp Glu Glu Asn 



Glu Glu Asp 
700 

Ala Ser Asp 
715 



His 



<210> 26 
<211> 104 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 26 


















Pro 


Arg 


Ser Trp 


Aytt 
ni y 


Lvs Ala 


Glv 

VJ J. Jf 


Pro 


Leu Pro 


Ser 


Gin 


lie 


1 






5 






10 






Val 


His 


Ala 


Pro Gly 


uiy 






Pro 


Leu Tvr 


Ala 


Asn 




20 








25 








30 


Lys 


Gly Lys 


Asp 


Glu Gly 


Val 


Val 


Tyr Ser 


Val 


Val 


His 


Arg 


35 








40 








45 




Lys 


Arg Ser 


Glu 


Ala Arg 


Ser 


Ala 


Glu Phe 


Thr 


Val 


Gly Arg 


50 






55' 








60 






Ser 


Ser lie 


lie 


Cys Ala 


Glu 


Val 


Arg Cys 


Leu 


Gin 


Pro 


Ser 


65 






70 








75 






Glu 


Ser 


Ser Thr 


Glu 


Val Asn 
85 


Met 


Arg 


Ser Arg 
90 


Thr 


Leu 


Gin 


Ser 


Asp Cys 


Glu 
100 


Glu Val 


Leu 


Cys 













15 



80 

Pro Le 1 
95 



<210> 27 
<211> 291 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 27 

Lys Thr Val Trp Leu Tyr Leu Gin Ala Trp Pro Asn Pro Val Phe Glu 

1 5 10 15 

Glv Asp Ala Leu Thr Leu Arg Cys Gin Gly Trp Lys Asn Thr Pro Leu 

20 25 30 

Ser Gin Val Lys Phe Tyr Arg Asp Gly Lys Phe Leu His Phe Ser Lys 

35 40 45 

Glu Asn Gin Thr Leu Ser Met Gly Ala Ala Thr Val Gin Ser Arg Gly 

50 55 60 

Gin Tyr Ser Cys Ser Gly Gin Val Met Tyr He Pro Gin Thr Phe Thr 
65 70 75 80 

Gin Thr Ser Glu Thr Ala Met Val Gin Val Gin Glu Leu Phe Pro Pro 

85 90 95 

Pro Val Leu Ser Ala He Pro Ser Pro Glu Pro Arg Glu Gly Ser Leu 

100 105 HO 

Val Thr Leu Arg Cys Gin Thr Lys Leu His Pro Leu Arg Ser Ala Leu 

115 120 125 

Arg Leu Leu Phe Ser Phe His Lys Asp Gly His Thr Leu Gin Asp Arg 

130 135 140 

Gly Pro His Pro Glu Leu Cys He Pro Gly Ala Lys Glu Gly Asp Ser 
145 150 155 160 

Gly Leu Tyr Trp Cys Glu Val Ala Pro Glu Gly Gly Gin Val Gin Lys 

165 170 175 

Gin Ser Pro Gin Leu Glu Val Arg Val Gin Ala Pro Val Ser Arg Pro 

180 185 190 

Val Leu Thr Leu His His Gly Pro Ala Asp Pro Ala Val Gly Asp Met 

195 200 205 

Val Gin Leu Leu Cys Glu Ala Gin Arg Gly Ser Pro Pro He Leu Tyr 
210 215 220 
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Ser Phe Tyr Leu. Asp Glu Lys He Val Sly j£ His Ser Ala Pro Cys 

ooc; 230 ^ «j 0 

Gly Gly Thr Thr Ser Leu Leu Phe Pro Val Lys Ser Glu Gin Asp Ala 

Gly Asn Tyr Ser Cys Glu Ala Glu Asn Ser Val Ser Arg Glu Arg Ser 

260 265 
Glu Pro Lys Lys Leu Ser Leu Lys Gly Ser Gin Val Leu Phe Thr Pro 
275 280 285 

Ala Ser Asn 
290 

<210> 28 
<211> 419 
<212> PRT 

<213> Artificial Sequence 

<220> , . 

<223> Description of Artificial Sequence : /note - 

synthetic construct 

Lys^hfval Trp Leu Tyr Leu Gin Ala Trp Pro Asn Pro Val Phe Glu 

1 5 10 

Gly Asp Ala Leu Thr Leu Arg Cys Gin Gly Trp Lys Asn Thr Pro Leu 

■20 25 
Ser Gin Val Lys Phe Tyr Arg Asp Gly Lys Phe Leu His Phe Ser Lys 
35 40 45 . 

Glu Asn Gin Thr Leu Ser Met Gly Ala Ala Thr Val Gin Ser Arg Gly 
50 55 60 

Gin Thr Ser Glu Thr Ala Met Val Gin Val Gin Glu Leu Phe Pro Pro 



Gin Tyr Ser Cys Ser Gly Gin Val Met Tyr lie Pro Gin Thr Phe Thr 



70 75 



85 90 95 T 

Pro Val Leu Ser Ala lie Pro Ser Pro Glu Pro Arg Glu Gly Ser Leu 

100 105 11 

Val Thr Leu Arg Cys Gin Thr Lys Leu His Pro Leu Arg Ser Ala Leu 

115 ' ~ 120 - I 25 

Arg Leu Leu Phe Ser Phe His Lys Asp Gly His Thr Leu Gin Asp Arg 

130 135 140 

Gly Pro His Pro Glu Leu Cys lie Pro. Gly Ala Lys Glu Gly Asp Ser 
i 45 150 155 

Gly Leu Tyr Trp Cys Glu Val Ala Pro Glu Gly Gly Gin Val Gin Lys 

165 170 
Gin Ser Pro Gin Leu Glu Val Arg Val Gin Ala Pro Val Ser Arg Pro 

180 185 I 90 

Val Leu Thr Leu His His Gly Pro Ala Asp Pro Ala Val Gly Asp Met 

195 200 205 

Val Gin Leu Leu Cys Glu Ala Gin Arg Gly Ser Pro Pro lie Leu Tyr 

210 215 220 

Ser Phe Tyr Leu Asp Glu Lys lie Val Gly Asn His Ser Ala Pro Cys 
225 230 235 

Gly Gly Thr Thr Ser Leu Leu Phe Pro Val Lys Ser Glu Gin Asp Ala 

245 250 255 

Gly Asn Tyr Ser Cys Glu Ala Glu Asn Ser Val Ser Arg Glu Arg Ser 

260 265 270 

Glu Pro Lys Lys Leu Ser Leu Lys Gly Ser Gin Val Leu Phe Thr Pro 

275 280 2 

Ala Ser Asn Trp Leu Val Pro Trp Leu Pro Ala Ser Leu Leu Gly Leu 

290 295 300 

Met Val lie Ala Ala Ala Leu Leu Val Tyr Val Arg Ser Trp Arg Lys 



305 



310 315 
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Ala Gly Pro Leu Pro Ser Gin He 

325 

Gin Cys Pro Leu Tyr Ala Asn Val 

340 

Gly Val Val Tyr Ser Val Val His 
355 360 
Arg Ser Ala Glu Phe Thr Val Gly 

370 375 
Ala Glu Val Arg Cys Leu Gin Pro 
385 390 
Asn Met Arg Ser Arg Thr Leu Gin 

405 

Val Leu Cys 



Pro Pro Thr Ala Pro Gly Gly Glu 

330 335 
His His Gin Lys Gly Lys Asp Glu 
345 350 
Arg Thr Ser Lys Arg Ser Glu Ala 

365 

Arg Lys Asp Ser Ser He He Cys 

380 

Ser Glu Val Ser Ser Thr Glu Val 
395 400 
Glu Pro Leu Ser Asp Cys Glu Glu 
410 415 



<210> 29 
<211> 16 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 29 

Met Leu Pro Arg Leu Leu Leu Leu He Cys Ala Pro Leu Cys Glu Pro 
15 10 15 



<210> 30 
<211> 19 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 30 

Met Leu Leu Trp Ser Leu Leu Val He Phe Asp Ala Val Thr Glu 

15 10 15 

Ala Asp Ser 



<210> 31 

<211> 17 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 31 

Met Leu Leu Trp Leu Leu Leu Leu He Leu Thr Pro Gly Arg Glu Gin 

15 10 I 5 

Ser 



<210> 32 
<211> 15 
<212> PRT 
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<213> Artificial Sequence 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

Me?°ieu 2 Leu Trp Thr Ala Val Leu Leu Phe Val Pro Cys Val Gly 
1 5 10 

<210> 33 
<211> 51 
<212> DNA 

<213> Artificial Sequence 

<220> ' , . _ 

<223> Description of Artificial Sequence : /note - 

. synthetic construct 



atgcJgccga ggctgttgct gttgatctgt gctccactct gtgaacctgc 
<210> 34 

<211> 1236 - 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note - 
synthetic construct 

<400> 34 



<210> 35 
<211> 60 
<212> DNA 

<213> Artificial Sequence 

<220> , _ 

<223> Description of Artificial Sequence: /note - 



51 



aaactgtttt tgatagccag cccctcccat cccacagagg ggagcccagt gaccctgacg 60 
?g?aagaSc cctttctaca gagttcagat gcccagttcc agttctgctt tttcagagac 120 
Scgggcct tgggcccagg ctggagcagc tcccccaagc tccagatcgc tgccatgtgg 180 
aaagSgaca cagggtcata ctggtgcgag gcacagacaa tggcgtccaa jgtcttgagg 
aacaagaqat cccagataaa tgtgcacagg gtccctgtcg ctgatgtgag cttggagact 
Sgc g cccSg gaggacaggt gatggaggga gacaggctgg tcctcatctg ctcagttgct 
atgggcacag gagacatcac cttcctttgg tacaaagggg ctgtaggttt aaac cttcag 
Kaaaaaccc agcgttcact gacagcagag tatgagattc cttcagtgag ggagagtgat 
actaaacaat Stactgtgt agctgaaaat ggctatggtc ccagccccag tgggctggtg 
ScSSSS tcagaaLcc ggtgLtcgc ccaatcctca tgctcagggc tcccagggcc 
caggctgcag tggaggatgt gctggagctt cactgtgagg ccctgagagg ctctcctcca 
atccta?act ggttttatca cgaggatatc accctgggga gcaggtcggc cccctctgga 
ggaggagcc^ ccttcaacct t?ccctgact gaagaacatt ctggaaacta ctcctgtgag 
^caacaatg gcctgggggc ccagcgcagt gaggcggtga cactcaactt cacagtgcct 
actcraagcca gaagcaatca tcttacctca ggagtcattg aggggctgct cagcaccctt 
iatccScca ccg^ggcctt attattttgc tacggcctca aaagaaaaat aggaagacgt 
?caaccaaag atcSctcag gagccttccc agccctctac cccaagagtt cacctacctc 
«S«22 ccccagggca gcUagcct atatatgaaa atgtgaatgt ^jgtggg 
aataaqottt attcactggc gtactataac cagccggagc aggaatcagt agcagcagaa 
accSgggga cacatatgga ggacaaggtt tccttagaca tctattccag gctgaggaaa 
gcaaacatta cagatgtgga ctatgaagat gctatg 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1236 
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synthetic construct 

<400> 35 - * ■ cn 

atgctgctgt ggtcattgct ggtcatcttt gatgcagtca ctgaacaggc agattcgctg bU 

<210> 36 
<211> 1464 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 36 ^ 
acccttgtgg cgccctcttc tgtcttcgaa ggagacagca tcgttctgaa atgccaggga 
gaacagaact ggaaaattca gaagatggct taccataagg ataacaaaga gttatctgtt 
ttcaaaaaat tctcagattt ccttatccaa agtgcagttt taagtgacag tggtaactat 
ttctgtagta ccaaaggaca actctttctc tgggataaaa cttcaaatat agtaaagata 
aaagtccaag agctctttca acgtcctgtg ctgactgcca gctccttcca gcccatcgaa 
gggggtccag tgagcctgaa atgtgagacc cggctctctc cacagaggtt ggatgttcaa 
ctccagttct gcttcttcag agaaaaccag gtcctggggt caggctggag cagctctccg 
gagctccaga tttctgccgt gtggagtgaa gacacagggt cttactggtg caaggcagaa 
acggtgactc acaggatcag aaaacagagc ctccaatccc agattcacgt gcagagaatc 
cccatctcta atgtaagctt ggagatccgg gcccccgggg gacaggtgac tgaaggacaa 
aaactgatcc tgctctgctc agtggctggg ggtacaggaa atgtcacatt ctcctggtac 
agagaggcca caggaaccag tatgggaaag aaaacccagc gttccctgtc agcagagctg 
gagatcccag ctgtgaaaga gagtgatgcc ggcaaatatt actgtagagc tgacaacggc 
catgtgccta tccagagcaa ggtggtgaat atccctgtga gaattccagt gtctcgccct 
gtcctcaccc tcaggtctcc tggggcccag gctgcagtgg gggacctgct ggagcttcac 
tgtgaggccc tgagaggctc tcccccaatc ttgtaccaat tttatcatga ggatgtcacc 
cttgggaaca gctcggcccc ctctggagga ggggcctcct tcaacctctc tttgactgca 
gaacattctg gaaactactc ctgtgaggcc aacaacggcc tgggggccca gtgcagtgag 
gcagtgccag tctccatctc aggacctgat ggctatagaa gagacctcat gacagctgga 
gttctctggg gactgtttgg tgtccttggt ttcactggtg ttgctttgct gttgtatgcc 
ttgttccaca agatatcagg agaaagttct gccactaatg aacccagagg ggcttccagg 
ccaaatcctc aagagttcac ctattcaagc ccaaccccag acatggagga gctgcagcca 
gtgtatgtca atgtgggctc tgtagatgtg gatgtggttt attctcaggt ctggagcatg 1380 
cagcagccag aaagctcagc aaacatcagg acacttctgg agaacaagga ctcccaagtc 
atctactctt ctgtgaagaa atca 

<210> 37 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note - 
synthetic construct 

<400> 37 

atgcttctgt ggctgctgct gctgatcctg actcctggaa gagaacaatc aggg 

<210> 38 
<211> 2148 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note - 
synthetic construct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1464 



54 
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ataacccLa aagctgtact tctcctcaat cctccatggt ccacagcctt caaaggagaa 

?2ta!gcag cagcatatca cattccctag cccagggaga caca^ttgg 
tatcacgatg agaagttgtt gaaaataaaa catgacaaga tccaaattac agagcctgga 
aattaccaat gtaagacccg aggatcctcc ctcagtgatg ccgtgcatgt OT"«ttca 
cccgactggc ?gatcctgca ggctttacat cctgtctttg «ggagacaa tgtcattctg 
aaatatcaqq ggaaagacaa caaaaacact catcaaaagg tttactacaa ggatggaaaa 
caac?tcc?a Stagt?ataa tttagagaag atcacagtga attcagtctc cagggataat 
agcaaatatc attgtactgc ttataggaag ttttacatac ttgacattga agtaacttca 
aaacccctaa atatccaagt tcaagagctg tttctacatc ctgtgctgag agccagctct 
tccacgccca tagaggggag tcccatgacc ctgacctgtg agacccagct ctctccacag 
aaaccaqatg tccagctgca attctccctc ttcagagata gccagaccct cggattgggc 
?ggagcagg? cccccagact ccagatccct gccatgtgga ctgaagactc agggtcttac 
taaStgagg tggagacagt gactcacagc atcaaaaaaa ggagcctgag atctcagata 
cg?g?acaga gagtccctgt gtctaatgtg aatctagaga tccggcccac cggagggcag 
c?gattgaag gagaaaatat ggtccttatt tgctcagtag cccagggttc agggactgtc 
acStctcc? ggcacaaaga aggaagagta agaagcctgg gtagaaagac ccagcgttcc 
ctgttggcag agctgcatgt tctcaccgtg aaggagagtg atgcagggag atactactgt 
gcagc?gata acgtLacag ccccatcctc agcacgtgga ttcgagtcac cgtgagaatt 
ccggtatctc accctgtcct caccttcagg gctcccaggg cccacactgt TOtgggggac 
ctqctggagc ttcactgtga gtccctgaga ggctctcccc cgatcctgta ccgattttat 
ca^gagglcg tcaccc?ggg gaacagctca gccccctctg gaggaggagc ctccttcaac 
ctctctctga ctgcagaaca ttctggaaac tactcctgtg atgcagacaa tggcctgggg 
gcccagcaca gtcatggagt gagtctcagg gtcacagttc cggtgtctcg ccccgtcctc 
accctcaggg ctcccggggc ccaggctgtg gtgggggacc tgctggagct tcactgtgag 
tccctgagag gctccttccc gatcctgtac tggttttatc acgaggatga caccttgggg 
aacatctcgg cccactctgg aggaggggda tccttcaacc tctctctgac tacagaacat 
tctggaaact actcatgtga ggctgacaat ggcctggggg cccagcacag taaagtggtg 
acactcaatg ttacaggaac ttccaggaac agaacaggcc ttaccgctgc g^^cacg 
gggctggtgc tcagcatcct cgtccttgct gctgctgctg ctctgctgca ttacgccagg 
gcccgaagga aaccaggagg actttctgcc actggaacat ctagtcacag tcctagcgag 
?gtcaggagc cttcctcgtc caggccttcc aggatagacc ctcaagagcc cactcactct 
aaaccactag ccccaatgga gctggagcca atgtacagca atgcaaatcc tggagatagc 
aacccgattt attcccagat ctggagcatc cagcatacaa aagaaaactc agctaattgt 
ccaatgatgc atcaagagca tgaggaactt acagtcctct att ^^ a ^^^!^^ 
cacccagacg actctgcagg ggaggctagc agcagaggca gggcccatga agaagatgat 
gaagaaaact atgagaatgt accacgtgta ttactggcct cagaccac 

<210> 39 
<211> 48 
<212> DNA 

<213> Artificial Sequence 

<220> , ^ 

<223> Description of Artificial Sequence : /note - 

synthetic construct 
<400> 39 

atgctgctct ggacggctgt gctgctcttt gttccctgtg ttgggaaa 

<210> 40 
<211> 2003 
<212> DNA 

<213> Artificial Sequence 

<220> . . 

<223> Description of Artificial Sequence : /no lc - 

synthetic construct 
<400> 40 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2148 



48 
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aggtctggtt gctctctgcc ggcttcgccc tgacctgttt ctgacctgtg ttccctccgc 60 

tgtgccagaa caggccccat gctgctctgg acggctgtgc tgctctttgt tccctgtgtt 12U 

gggaaaactg tctggctgta cctccaagcc tggccaaacc ctgtgtttga aggagatgcc 180 

ctgactctgc gatgtcaggg atggaagaat acaccactgt ctcaggtgaa gttctacaga 240 

gatggaaaat tccttcattt ctctaaggaa aaccagactc tgtccatggg agcagcaaca 300 

gtgcagagcc gtggccagta cagctgctct gggcaggtga tgtatattcc acagacattc 360 

acacaaactt cagagactgc catggttcaa gtccaagagc tgtttccacc tcctgtgctg 420 

agtgccatcc cctctcctga gccccgagag ggtagcctgg tgaccctgag atgtcagaca 48 0 

aagctgcacc ccctgaggtc agccttgagg ctccttttct ccttccacaa ggacggccac 540 

accttgcagg acaggggccc tcacccagaa ctctgcatcc cgggagccaa ggagggagac 600 
tctgggcttt actggtgtga ggtggcccct gagggtggcc aggtccagaa gcagagcccc 
cagctggagg tcagagtgca ggctcctgta tcccgtcctg tgctcactct gcaccacggg 

cctgctgacc ctgctgtggg ggacatggtg cagctcctct gtgaggcaca gaggggctcc 7 80 

cctccgatcc tgtattcctt ctaccttgat gagaagattg tggggaacca ctcagctccc 840 

tgtggtggaa ccacctccct cctcttccca gtgaagtcag aacaggatgc tgggaactac 900 

" tcctgcgagg ctgagaacag tgtctccaga gagaggagtg agcccaagaa gctgtctctg 9 60 

aagggttctc aagtcttgtt cactcccgcc agcaactggc tggttccttg gcttcctgcg 1020 

agcctgcttg gcctgatggt tattgctgct gcacttctgg tttatgtgag atcctggaga 1080 

aaagctgggc cccttccatc ccagatacca cccacagctc caggtggaga gcagtgccca 1140 

ctatatgcca acgtgcatca ccagaaaggg aaagatgaag gtgttgtcta ctctgtggtg 1200 

catagaacct caaagaggag tgaagccagg tctgctgagt tcaccgtggg gagaaaggac 12 60 

agttctatca tctgtgcgga ggtgagatgc ctgcagccca gtgaggtttc atccacggag 1320 

gtgaatatga gaagcaggac tctccaagaa ccccttagcg actgtgagga ggttctctgc 1380 

tagtgatggt gttctcctat caacacacgc ccacccccag tctccagtgc tcctcaggaa 1440 

gacagtgggg tcctcaactc tttctgtggg tccttcagtt cccaagccca gcatcacaga 1500 

gccccctgag cccttgtcct ggtcaggagc acctgaaccc tgggttcttt tcttagcaga 1560 

agaccaacca atggaatggg aagggagatg ctcccaccaa cacacacact taggttcaat 1620 

cagtgacact ggacacataa gccacagatg tcttctttcc atacaagcat gttagttcgc 1680 

cccaatatac atatatatat gaaatagtca tgtgccgcat aacaacattt cagtcagtga 1740 

tagactgcat acacaacagt ggtcccataa gactgtaatg gagtttaaaa attcctactg 1800 

cctagtgata tcatagttgc cttaacatca taacacaaca catttctcac gcgtttgtgg 18 60 

tgatgctggt acaaacaagc tacagcgccg ctagtcatat acaaatatag cacatacaat 1920 

tatqtacagt acactatact tgataatgat aataaacaac tatgttactg gtctaaaaaa 1980 

2003 

aaaaaaaaaa aaaaaaaaaa aaa 

<210> 41 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 41 

tgagtctcag ggtcacagtt ccg 

<210> 42 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 42 

gctcttgaac ttggatattt aggggt 26 

<210> 43 
<211> 25 



23 



25/52 



WO 03/089624 



<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: /note ~ 
synthetic construct 

<400> 43 

ccagtgtatg tcaatgtggg ctctg 

<210> 44 
<211> 27 , 
<212> DMA 

<213> Artificial Sequence 

<220> . 
<223> Description of Artificial Sequence: /note - 

synthetic construct 
<400> 44 

cgttgaaaga gctcttggac ttttatc 

<210> 45 
<211> 27 
<212> DNA 

<213> Artificial Sequence 

<220> . . 

<223> Description of Artificial Sequence : /note - 

synthetic construct 
<400> 45 

gcctcaaaag aaaaatagga agacgtt 

<210> 46 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<220> . . 

<223> Description of Artificial Sequence: /note 

synthetic construct 
<400> 46 

aagctcacat cagcgacagg gac 

<210> 47 

<211> 22 

<212> DNA 

<213> Artificial Sequence 

<220> . . 

<223> Description of Artificial Sequence : /note 

synthetic construct 
<400> 47 

tcttggagat aagtcgggct tt 

<210> 48 
<211> 25 
<212> DNA 



26/52 



WO 03/089624 



<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 48 

atcctgcagc ccagcctcgt aggag 

<210> 49 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: /note 
synthetic construct 

<400> 49 

ggtcctcatg ctgctgtggt catt 

<210> 50 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 50 

gctgttgatc ttcccttctg attc 

<210> 51 

<2U> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 51 

atgctgccga ggctgttgct gttg 

<210> 52 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 52 

catagcatct tcatagtcca catc 

<210> 53 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



27/52 



WO 03/089624 



<220> > . A _ 

<223> Description of Artificial Sequence : /note - 

synthetic construct 
<400> 53 

ctcaacttca cagtgcctac tggg 

<210> 54 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 54 

tcctgcagag tcactaacct tgag 

<210> 55 
<211> 25 
<212> DNA 

<213> Artificial Sequence 

<220> . . 

<223> Description of Artificial Sequence: /note 

synthetic construct ' 
<400> 55 

ccagtgtatg tcaatgtggg ctctg 

<210> 56 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> . 
<223> Description of Artificial Sequence : /note 

synthetic construct 
<400> 56 

cattcttccc tcaaatcttt acac 

<210> 57 

<211> 21 

<212> DNA 

<213> Artificial Sequence 

<220> . . 

<223> Description of Artificial Sequence : /note 

synthetic construct 
<400> 57 

cagcacgtgg attcgagtca c 

<210> 58 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 58 

cagatctggg aataaatcgg gttg 

<210> 59 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 59 

tcttcagaga tggcgaggtc a 

<210> 60 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 60 

ttttggggtg tacatcaaca tacaag 

<210> 61 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 61 

tgttgccctg tttcttccaa taca 

<210> 62 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 62 

cagagttggc cgacctacgc 

<210> 63 
<211> 32 
<212> PRT 

<213> Artificial Sequence 
<220> 



29/52 
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<22-3> Description of Artificial Sequence: /note = 
synthetic construct 



<221> VARIANT 

<222> 5, 15, 17, 22, 28 

<223> X can be any amino acid 



Gly°Glu 3 pro He Xaa Leu Arg Cys His Ser Trp Lys Asp Lys Xaa Leu 

Xaa Lys Val Thr Tyr Xaa Gin Asn Gly Lys Ala Xaa Lys Phe Phe His 

20 25 30 



<210> 64 

<211> 17 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: 
synthetic construct 

<221> VARIANT 
<222> 1 

<223> X can be either Glu or Asp 



<221> VARIANT 
<222> 7 

<223> X can be either Leu or He 

<221> VARIANT 
<222> 17 

<223> X can be either Leu or He 



<221> VARIANT 

<222> 2-3, 5-6, 8-13, 15-16 
<223> X can be any amino acid 



4 



5aa°Xaa 4 Xaa Tyr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Xaa 
1 5 10 ■ 15 

Xaa 



<210> 65 

<211> 18 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
synthetic construct 

<221> VARIANT 
<222> 1 

<223> X can be either Glu or Asp 

<221> VARIANT 
<222> 7 

<223> X can be either Leu or He 
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<221> VARIANT 
<222> 18 

<223> X can be either Leu or lie 
<221> VARIANT 

<222> 2-3, 5-6, 8-14, 16-17 
<223> X can be any amino acid 

<400> 65 

Xaa Xaa Xaa Tyr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr 

1 5 10 15 

Xaa Xaa 



<210> 66 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<221> VARIANT 
<222> 1 

<223> X can be either Glu or Asp 

<221> VARIANT 
<222> 7 

<223> X can be either Leu or lie 

<221> VARIANT 
<222> 19 

<223> X can be either Leu or lie 
<221> VARIANT 

<222> 2-3, 5-6, 8-15, 17-18 
<223> X can be any amino acid 

<400> 66 

Xaa Xaa Xaa Tyr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

15 10 15 

Tyr Xaa Xaa 



<210> 67 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<221> VARIANT 
<222> 1 

<223> X can be either lie orVal or Leu or Ser 

<221> VARIANT 
<222> 2, 4-5 

<223> X can be any amino acid 
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<221> VARIANT 
<222> 6 

<223> X can be Leu or Val 

■ 

<400> 67 

Xaa Xaa Tyr Xaa Xaa Xaa 
1 5 



<210> 68 
<211> 492 
<212> PRT 

<213> Artificial Sequence 



<220> . . A _ 

<223> Description of Artificial Sequence : /note - 

synthetic construct 

Asp°Trf Leu ser lie Ser Leu Pro His Arg Ser Tyr Glu Gly Asp Gin 

Val Val He Ser Cys Thr Gly Lys Asn Asn Gly Asp He Lys Arg Leu 

20 25 
Lys Tyr Phe Lys Asp Gly Tyr His lie Glu Thr Tyr Ser Ser Ala Ser 

Ser Tyr Thr lie Arg Asn Ala Arg Arg Gly Asp Ser Gly Ser Tyr Ser 

en, 55 60 

Cys Lys Ala Asp Arg Lys Phe Phe Leu Phe lie Asp Thr Thr Glu Glu 

Thr Gly Ser Lys Tr P Leu Asn Val Gin Glu Leu Phe Pro Ala Pro Gly 

85 90 
Leu Thr Ala Ser Pro Leu Gin Pro Val Glu Gly Ser Ser Val Thr Leu 

100 105 
Ser Cys Asn Thr Trp Leu Pro Ser Asp Arg Ala Thr Thr Gin Leu Arg 

115 120 125 n 

Tyr Ser Phe Phe Lys Asp Gly His Thr Leu Gin Ser Gly Trp Thr Ser 

nn 135 -^U 

Ser Lys Phe Thr He Ser Ala lie Ser Lys Glu Asp Ser Gly Asn Tyr 
1 ac. 150 

Trp Cys Glu Ala Met Thr Ala Ser Arg Ser Val Ser Lys Gin Ser His 



Arg Ser Tyr He Asp Val Glu Arg lie Pro Val Ser Gin Val Thr Met 

180 185 
Glu lie Gin Pro Ser Arg Gly Trp Gly Val Glu Gly Glu Pro Leu Val 

195 200 205 

Val Glu Gly Glu Pro Leu Val Leu Ala Cys Ser Val Ala Lys Gly Thr 



210 215 22 0 



Gly Leu lie Thr Phe Ser Trp His Arg Gin Asp Thr Lys Glu Ser Val 
r><A 230 235 ^ 

Gly Lys Lys Ser Gin Arg Ser Gin Arg Val Glu Leu Glu He Pro Thr 

245 250 
He Arg Glu Ser His Ala Gly Gly Tyr Tyr Cys Thr Ala Asp Asn Asn 

260 ' 265 

Val Leu Asn Pro Leu Leu Ser lie Ser Val Pro Gly Val Leu Pro Phe 



Tyr Gly Leu He Gin Ser Ala lie Val Asn He Thr Val Lys He Pro 



280 



290 295 300 



lie Gly Asp Val Ala Glu Leu His Cys Glu Asp Lys Arg Ala Ser Pro 
- n c " 310 315 

Pro Val Leu Tyr Trp Phe Tyr His Glu Asn He Thr Leu Ala Asn Thr 
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Ser Ala Pro Phe 

340 

Gly His Ser Gly 
355 

Lys Arg Ser Glu 
370 

Val Arg Leu Val 
385 

Glu Gin Glu Gly 

Arg Asp Val Ala 

420 

His Thr Pro He 
435 

Pro Val Leu He 
450 

Ala Glu Cys Asp 
465 

Ala Gly Ala Val 



325 

Gly Gly Lys Ala 

Asn Tyr Ser Cys 

360 

Val Val Thr Leu 
375 

Asn Gly Pro His 
390 

Arg Trp Gly Thr 
405 

Val Val Cys Arg 

Ala Met Leu Tyr 

440 

Gin Val Ala Leu 
455 

Gin Val Glu Ala 
470 

Cys Glu Val Leu 
485 





330 






Ser 


Phe 


Lys 


Leu 


345 








Glu 


Ala 


Glu 


Asn 


Asn 


Val 


Thr 


Glu 








380 


His 


Cys 

** 


Glu 


Gly 






395 




Val 


Cys 


Asp 


Asp 




410 






Glu 


Leu 


Gly 


Cys 


425 








Pro 


Pro 


Ala 


Val 


Cys 


Asn 


Gly 


Thr 








460 


Phe 


Asp 


Cys 


Gly 






475 




Pro 


Ser 


Thr 


Phe 




490 







335 

Ser Leu Thr Ala 
350 

Ala Trp Gly Thr 
365 

Pro Pro Pro Lys 

Arg Val Glu Val 

400 

Gly Trp Asp Met 
415 

Gly Ala Ala Gin 
430 

Asp Glu Ala Leu. 
445 

Glu Lys Thr Leu 

His Asp Glu Asp 

480 



<210> 69 
<211> 17 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : /note - 
synthetic construct 

<400> 69 

Met Pro Leu Cys Leu Leu Leu Leu Val Phe Ala Pro Val Gly Val Gin 

15 10 15 

Ser 



<210> 70 
<211> 383 
<212> PRT 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 70 



Asp 


Trp 


Leu 


Ser 


1 








Val 


Val 


He 


Ser 








20 


Lys 


Tyr 


Phe 


Lys 






35 




Ser 


Tyr 


Thr 


He 




50 






Cys 


Lys 


Ala 


Asp 


65 








Thr 


Gly 


Ser 


Lys 


Leu 


Thr 


Ala 


Ser 








100 



He 


Ser 


Leu 


Pro 


5 








Cys 


Thr 


Gly 


Lys 


Asp 


Gly 


Tyr 


His 








40 


Arg 


Asn 


Ala 


Arg 






55 




Arg 


Lys 


Phe 


Phe 




70 






Trp 


Leu 


Asn 


Val 


85 








Pro 


Leu 


Gin 


Pro 



His 


Arg 


Ser 


Tyr 




10 






Asn 


Asn 


Gly 


Asp 


25 








He 


Glu 


Thr 


Tyr 


Arg 


Gly 


Asp 


Ser 








60 


Leu 


Phe 


He 


Asp 






75 




Gin 


Glu 


Leu 


Phe 




90 






Val 


Glu 


Gly 


Ser 



105 



Glu Gly Asp Gin 
15 

He Lys Arg Leu 
30 

Ser Ser Ala Ser 
45 

Gly Ser Tyr Ser 

Thr Thr Glu Glu 

80 

Pro Ala Pro Gly 
95 

Ser Val Thr Leu 
110 
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Ser Cys Asn Thr Trp Leu Pro Ser Asp Arg Ala Thr Thr Gin Leu Arg 
~~ • 12 0 125 



Tyr Ser III Phe Lys Asp Gly H~is Thr Leu Gin Ser Gly Trp Thr Ser 

130 135 140 • 

Ser Lys Phe Thr lie Ser Ala lie Ser Lys Glu Asp Ser Gly Asn Tyr 
1 a 150 

Trp Cys Glu Ala Met Thr Ala Ser Arg Ser Val Ser Lys Gin Ser His 
Arg Ser Tyr lie fep Val Glu Arg lie Pro Val Ser Gin Val Thr Met 



180 

Glu lie Gin Pro Ser Arg Gly Trp Gly Val Glu Gly Glu Pro Leu Val 

195 200 
Val Glu Gly Glu Pro Leu Val Leu Ala Cys Ser Val Ala Lys Gly Thr 

210 " 215 ,r i 

Gly Leu lie Thr Phe Ser Trp His Arg Gin Asp Thr Lys Glu Ser Val 

ool 230 235 Z 

Gly Lys Lys Ser Gin Arg Ser Gin Arg Val Glu Leu Glu He Pro Thr 

He Arg Glu Gly His Ala Gly Gly Tyr Tyr Cys Thr Ala Asp Asn Asn 

260 265 
Tyr Gly Leu He Gin Ser Ala He Val Asn lie Thr Val Lys He Pro 

Val Leu III Pro Leu Leu Ser He Ser Val Pro Gly Val Leu Pro Phe 

290 295 300 

lie Gly Asp Val Ala Glu Leu His Cys Glu Asp Lys Arg Ala Ser Pro 
opic 310 315 

Pro Val Leu Tyr Trp Phe Tyr His Glu Asn lie Thr Leu Ala Asn Thr 

325 330 
Ser Ala Pro Phe Gly Gly Lys Ala Ser Phe Lys Leu Ser Leu Thr Ala 

340 345 
Gly His Ser Gly Asn Tyr Ser Cys Glu Ala Glu Asn Ala Trp Gly Thr 

355 360 365 

Lys Arg Ser Glu Val Val Thr Leu Asn Val Thr Gly Arg Thr He 
370 375 380 

<210> 71 
<211> 17 
<212> PRT 

<213> Artificial Sequence 

<220> , . A _ 

<223> Description of Artificial Sequence: /note - 

synthetic construct 

uTlr^Leu Cys Leu Leu Leu Leu Val Phe Ala Pro Val Gly Val Gin 

1 5 10 

Ser 



185 19° 



<210> 72 
<211> 17 
<212> PRT 

<213> Artificial Sequence 

<220> . . 

<223> Description of Artificial Sequence : /note 

synthetic construct 
<400> 72 
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Met Leu Pro Trp Leu Leu Leu Leu lie Cys Ala Leu Pro Cys Glu Pro 

15 10 15 

Ala 



<210> 73 
<211> 326 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 73 



biy 


i le 


O a v 

oer 


ASp 


vai 


On v 

oer 


Lieu 


Lys 


i 








5 








wet 


blU 


biy 


Asp 


Lys 


Leu 


vai. 


Leu 








20 










biy 


Asn 


lie 


Thr 


Tyr 


Pne 


Trp 


Tyr 
















4 U 


bill 


inr 


jjys 


Thr 


Gin 


-v j»>\ 

Pro 


O fS. V* 

oer 


Leu 




o u 














wee 


ijy s 


bin 


Ser 


Asp 


TV 1 - 

Ala 


ASp 


Ply, 

bin 












/ u 






HIS 


ASp 


pro 


lie 


Ala 


oer 


blU 


Leu 










85 








v a± 


oer 


Arg 


Pro 


Val 


lieu 


ITl 

inr 


Pne 








100 










lieu 


Gly 


ASp 


Leu 


Val 


blU 


T Al 1 

Leu 


nlS 






1 1 c 

IX J 










1ZU 


pro 


lie 


Pne 


Tyr 


Gin 


Pne 


Tyr 


HIS 




1 JU 










loo 




C A V 

oer 


Ala 


pro 


Ser 


Gly 


biy 


biy 


Ala 


X H D 
















blU 


JtilS 


O V" 

oer 


Gly Asn 


pne 


oer 


cys 










165 








Gin 


Arg 


Ser 


Glu 


Val 


Val 


Ala 


Leu 








180 










Pro 


Thr 


Glu 


Asn Gly 


lie 


Ser 


His 






195 










200 


Leu 


Leu 


Gly 


Cys 


Leu 


Ser 


Pro 


He 




210 










215 




Trp 


Leu 


Lys 


Arg 


Lys 


lie 


Gly 


Arg 


225 










230 






Pro 


Pro 


Gin 


Thr 


Val 


Leu 


Gin 


Gly 










245 








Ser 


Arg 


Gin 


Pro 


Glu 


Pro 


Leu 


Tyr 








260 










Asn 


Glu 


Val 


Tyr 


Ser 


Leu 


Val 


Tyr 






275 










280 


Ala 


Ala 


Ala 


Gin 


His 


Val 


Arg 


Thr 




290 










295 




Val 


Ser 


Ser 


Gly Leu 


Tyr 


Ser 


Lys 


305 










310 






Asp 


Tyr 


Glu 


Asp Ala 


Met 







325 



rn V-» w» 

rnr 


Arg 


Pro 


Pro 


Gly Gly 


Trp 


vai 




1 A 
1U 










15 




He 


Cys 


Ser 


Val 


Asp Arg 


Val 


Thr 


ZD 










30 






Arg 


Gly 


Ala 


Leu 


Gly 
45 


Phe 


Gin 


Leu 


Tnr 


Ala 


GlU 


Phe 
60 


Glu 


He 


Ser 


Asp 


Tyr 


Tyr 


Cys 
75 


Ala 


Ala 


Asn 


Asp 


Gly 
80 


vai 


oer 


lie 


His 


Val Arg 


Val 


7*^ j—* 

Pro 




90 










95 




Gly 


Asp 


Ser 


Gly Thr Gin 


Ala 


Val 


1U3 










110 






Cys 


Lys 


Ala 


Leu Arg Gly 


Ser 


Pro 










125 








blU 


ser 


He 


He 


Leu Gly 


Asn 


Ser 








140 










Ser 


Pne 


Asn 
155 


Phe 


Ser 


Leu 


Thr 


Ala 
160 


Glu 


Ala 


Ser 


Asn Gly Gin 


Gly 


Ala 




170 










175 




Asn 


Leu 


Thr 


Gly 


Leu 


Ser 


Leu 


Val 


185 










190 






Leu 


Ser 


Leu 


Gly 


Leu 
205 


Thr 


Gly 


Trp 


Thr 


Met 


Ala 


Leu 
220 


He 


Phe 


Cys 


Tyr 


Gin 


Ser 


Glu 
235 


Asp 


Pro 


Val 


Arg 


Ser 
240 


Ser 


Thr 
250 


Tyr 


Pro 


Lys 


Ser 


Pro 
255 


Asp 


Glu 


Asn 


Val 


Asn 


Val 


Val 


Ser 


Gly 


265 










270 






His 


Thr 


Pro 


Gin 


Val 
285 


Leu 


Glu 


Pro 


His 


Gly 


Val 


Ser 
300 


Glu 


Ser 


Phe 


Gin 


Pro 


Arg 


He 
315 


Asn 


He 


Ala 


His 


Met 
320 



<210> 74 
<211> 203 
<212> PRT 
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<213> Artificial Sequence 

<220> ' t ^ 

<223> Description of Artificial Sequence : /note - 

synthetic construct 

<400> 74 „ , 

Gly lie Ser Asp Val Ser Leu Lys Thr Arg Pro Pro Gly Gly Trp Val 

I ' 5 10 15 

Met Glu Gly Asp Lys Leu Val Leu lie Cys Ser Val Asp Arg Val Thr 

20 25 30 

Gly Asn lie Thr Tyr' Phe Trp Tyr Arg Gly Ala Leu Gly Phe Gin Leu 

35 40 45 

Glu Thr Lys Thr Gin Pro Ser Leu Thr Ala Glu Phe Glu He Ser Asp 

50 55 60 

Met Lys Gin Ser Asp Ala Asp Gin Tyr Tyr Cys Ala Ala Asn Asp Gly 
65 70 75 80 

His Asp Pro lie Ala Ser Glu Leu Val Ser lie His Val Arg Val Pro 

85 90 95 

Val Ser Arg Pro Val Leu Thr Phe Gly Asp Ser Gly. Thr Gin Ala Val 

100 105 HO 

Leu Gly Asp Leu Val Glu Leu His Cys Lys Ala Leu Arg Gly Ser Pro 

115 120 125 

Pro He Phe Tyr Gin Phe Tyr His Glu Ser He He Leu Gly Asn Ser 

130 135 140 

Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Phe Ser Leu Thr Ala 
145 150 155 160 

Glu His Ser Gly Asn Phe Ser Cys Glu Ala Ser Asn. Gly Gin Gly Ala 

165 170 175 

Gin Arg Ser Glu Val Val Ala Leu Asn Leu Thr Gly Leu Ser Leu Val 

180 185 190 

Pro Thr Glu Asn Gly He Ser His Leu Ser Leu 
195 200 

<210> 75 
<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 75 _ _ 

Met Leu Pro Trp Leu Leu Leu Leu lie Cys- Ala Leu Pro Cys Glu Pro 

15 10 I 5 

Ala 



<210> 76 
<211> 100 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note - 
synthetic construct 

<400> 76 

Lys Arg Lys He Gly Arg Gin Ser Glu Asp Pro Val Arg Ser Pro Pro 
15 10 .15 
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Gin 


Thr Val 


Leu 


Gin 


Gly Ser 


Thr 


Tyr 


Pro 


Lys 


Ser 


Pro 


Asp 


Ser Arg 






20 










25 










30 






Gin 


Pro Glu 


Pro 


Leu 


Tyr 


Glu 


Asn 


Val 


Asn 


Val 


Val 


Ser 


Gly Asn 


Glu 




35 










40 










45 








Val 


Tyr Ser 
50 


Leu 


Val 


Tyr 


His 
55 


Thr 


Pro 


Gin 


Val 


Leu 
60 


Glu 


Pro 


Ala 


Ala 


Ala 


Gin His 


Val 


Arg 


Thr 


His 


Gly 


Val 


Ser 


Glu 


Ser 


Phe 


Gin 


Val 


Ser 


65 








70 










75 










80 


Ser 


Gly Leu 


Tyr 


Ser 


Lys 


Pro Arg 


He 


Asn 


He 


Ala 


His 


Met 


Asp 


Tyr 








85 










90 










95 




Glu 


Asp Ala 


Met 
100 



























<210> 77 
<211> 283 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthetic construct 

<400> 77 



Gly 


He 


Ser 


Asp 


Val 


Ser 

mm* *W mim 


J— J W 




1 








5 








Met 


Glu Gly 


Asp Lys 


Leu 


Val 


Leu 

4-J w 1*4 








20 










Gly 


Asn 


He 
35 


Thr 


Tyr 


Phe 


Trp 


Tvr 
40 


Glu 


Thr 
50 


Lys 


Thr 


Gin 


Pro 


Ser 

Km* V-n JL. 

55 


Leu 


Met 


Lys 


Gin 


Ser Asp 


Ala 

X A ml* ti 


Asn 


Gin 


65 










70 






His 


Asp 


Pro 


He 


Ala 
85 


Ser 


Glu 


Leu 


Val 


Ser Arg 


Pro 


Val 


Leu 


Thr 


Phe 








100 










Leu 


Gly Asp 


Leu 


Val 


Glu 


Leu 


His 






115 










120 


Pro 


He 
130 


Phe 


Tyr 


Gin 


Phe 


Tyr 
135 


His 


Ser 


Ala 


Pro 


Ser 


Gly 


Gly 


Gly 


Ala 


145 










150 






Glu 


His 


Ser 


Gly Asn 


Phe 


Ser 


Cys 










165 








Gin 


Arg 


Ser 


Glu 
180 


Val 


Val 


Ala 


Leu 


Asp 


Pro 


Val 
195 


Arg 


Ser 


Pro 


Pro 


Gin 
200 


Pro 


Lys 
210 


Ser 


Pro 


Asp 


Ser 


Arg 
215 


Gin 


Asn 


Val 


Val 


Ser 


Gly 


Asn 


Glu 


Val 


225 








230 






Gin 


Val 


Leu 


Glu 


Pro 
245 


Ala 


Ala 


Ala 


Ser 


Glu 


Ser 


Phe 
260 


Gin 


Val 


Ser 


Ser 


Asn 


He 


Ala 
275 


His 


Met 


Asp 


Tyr 


Glu 
280 



Sequence : /note = 



Thr 

X 1X1. 


Arcr 


Pro 


Pro 


Gly 


Gly Trp 


Val 




10 










15 




He 

~+m \mP 


Cvs 


Ser 


Val 


Asp 


Arg 


Val 


Thr 


25 










30 






Ara 


Glv 


Ala 

A. i mi- K-+ 


Leu 


Gly 
45 


Phe 


Gin 


Leu 


Thr 


Ala 


Glu 

\J ml* iJL 


Phe 


Glu 


He 


Ser 


Asp 








60 








Tyr 


Tyr 


Cys 


Ala 


Ala 


Asn Asp 


Gly 






75 










80 


Val 


Ser 
90 


He 


His 


Val 


Arg 


Val 
95 


Pro 


Gly 


Asp 


Ser 


Gly Thr 


Gin 


Ala 


Val 


105 










110 






Cys 


Lys 


Ala 


Leu 


Arg 
125 


Gly 


Ser 


Pro 


Glu 


Ser 


He 


He 


Leu 


Gly Asn 


Ser 








140 










Ser 


Phe 


Asn 
155 


Phe 


Ser 


Leu 


Thr 


Ala 
160 


Glu 


Ala 


Ser 


Asn 


Gly 


Gin 


Gly Ala 


* 


170 










175 




Asn 


Leu 


Thr 


Gly Arg 


Gin 


Ser 


Glu 


185 










190 






Thr 


Val 


Leu 


Gin 


Gly 
205 


Ser 


Thr 


Tyr 


Pro 


Glu 


Pro 


Leu 
220 


Tyr 


Glu 


Asn 


Val 


Tyr 


Ser 


Leu 
235 


Val 


Tyr 


His 


Thr 


Pro 
240 


Gin 


His 


Val 


Arg 


Thr 


His 


Gly Val 




250 










255 




Gly 


Leu 


Tyr 


Ser 


Lys 


Pro Arg 


He 


265 










270 






Asp 


Ala 


Met 













<210> 78 
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<211> 570 
<212> PRT 

<213> Artificial Sequence 



<223> Description of Artificial Sequence : /note = 
synthetic construct 

G^GlnVs Glu Ala Ala Gin Gin Ser Val Val Ser Leu Gin Pro Pro 

i 5 1 10 15 

Trp Thr Thr Phe Phe Arg Gly Glu Val Val Thr Leu Thr Cys Tyr Arg 

20 25 30 

Phe Gly Phe Ser Val Pro Gin Lys Thr Lys Trp Tyr Gin Lys Arg Lys 

35 40 45 

Thr Val Lys Gin Thr Pro Gly Ala Leu Val lie Lys Ala His Thr Leu 

50 ~ 55 60 

Lys Val His Glu Ser Gly Glu Tyr Trp Cys Gin Ala Asp Ser Leu Leu 

65 70 75 80 

Pro Ser Met His Val Asn Val Glu Phe Ser Glu Asp Phe Leu Val Leu 

85 9° * 5 „ 

Gin Ala Pro Pro Ala Val Phe Glu Gly Asp Ser Val Val Leu Arg Cys 

100 105 110 

Tvr Ala Lys Lys Gly He Glu Ala Glu Thr Leu Thr Phe Tyr Lys Asp 

125 120 125 

Gly Lys Ala Leu Thr Leu His His Gin Ser Glu Leu Ser lie His His 

130 135 "0 

Ala Asn Leu Lys Asp Asn Gly Gin Tyr Lys Cys Thr Ser Lys Lys Lys 

145 I 50 155 , „ i 

Trp Ser Phe Gly Ser Leu Tyr Thr Ser Asn Thr Val Gly Val Gin Val 

165 I 70 
Gin Glu Leu Phe Pro Arg Pro Val Leu Arg Ala Arg Pro Ser His Pro 

180 185 i90 

He Asp Gly Ser Pro Val Thr Leu Thr Cys Gin Thr Gin Leu Ser Ala 

!95 200 205 

Gin Lys Ser Asp Ala Arg Leu Gin Phe Cys Phe Phe Arg Asn Leu Gin 

210 215 22 

Leu Leu Gly Ser Gly Cys Ser Arg Ser Ser Glu Phe His lie Pro Ala 
225 230 235 240 

He Trp Thr Glu Glu Ser Arg Arg Tyr Gin Cys Lys Ala Glu Thr Val 

245 250 255 

Asn Ser Gin Val Arg Lys Gin Ser Thr Ala Phe lie lie Pro Val Gin 

260 265 270 

Arg Ala Ser Ala Arg Phe Gin Thr His lie lie Pro Ala Ser Lys Leu 

275 280 285 

Val Phe Glu Gly Gin Leu Leu Leu Leu Asn Cys Ser Val Lys Gly Val 

290 295 300 

Pro Gly Pro Leu Lys Phe Ser Trp Tyr Lys Lys Asp Met Leu Asn Glu 
305 310 315 ^ u 

Glu Thr Lys lie Leu Lys Ser Ser Asn Ala Glu Phe Lys lie Ser Gin 

325 330 . 335 

Val Asn lie Ser Asp Ala Gly Glu Tyr His Cys Glu Ala Thr Asn Ser 

340 345 350 

Arg Arg Ser Phe Val Ser Arg Ala Phe Pro He Thr He Lys Val Pro 

355 360 365 

Val Ser Gin Pro Val Leu Thr Leu Ser Thr Gly Lys Thr Gin Ala Leu 

370 375 380 

Glu Gly Asp Leu Met Thr Leu His Cys Gin Ser Gin Arg Gly Ser Pro 
385 ~ 390 395 « uu 

Cys He Leu Tyr Glu Phe Phe Tyr Glu Asn Val Ser Leu Gly Asn Ser 

405 410 415 
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Ser lie Leu Ser Gly Gly Gly Ala Tyr Phe Asn Phe Ser Met Ser Thr 

420 425 430 

Glu Arg Ser Gly Asn Tyr Tyr Cys Thr Ala Asp Asn Gly Leu Gly Ala 

435 440 445 

Gin Cys Ser Glu Ala He Arg He Ser He Phe Asp Met Thr Lys Asn 

450 455 460 

Arg Ser Val Pro Met Ala Ala Gly He Thr Val Gly Leu Leu He Met 
465 470 475 480 

Ala Val Gly Val Phe Leu Phe Tyr Cys Trp Phe Ser Arg Lys Ala Gly 

485 490 495 

Gly Lys Pro Thr Ser Asp Asp Ser Arg Asn Pro Ser Asp Ser Glu Pro 

500 505 510 

Gin Glu Pro Thr Tyr Tyr Asn Val Pro Ala Cys He Glu Leu Gin Pro 

515 520 525 

Val Tyr Ser Asn Glu Pro Glu Glu Asn Val He Tyr Thr Glu Val Arg 

530 535 540 

Arg Thr Gin Pro Arg Gin Lys His Ala Asp Gin Glu Ser Glu Ser Pro 
545 550 555 560 

Arg Ser Arg Cys Gin Met Ala Glu Lys Lys 

565 570 

<210> 79 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: /note = 
synthetic construct 

<400> 79 

Met Ser Gly Ser Phe Ser Pro Cys Val Val Phe Thr Gin Met Trp Leu 

15 10 15 

Thr Leu Leu Val Val Thr Pro Val Asn 

20 25 

<210> 80 
<211> 468 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 80 




























Gly 


Gin 


His 


Glu 


Ala 


Ala 


Gin 


Gin 


Ser 


Val 


Val 


Ser 


Leu 


Gin 


Pro 


Pro 


1 








5 










10 










15 




Trp 


Thr 


Thr 


Phe 


Phe 


Arg 


Gly 


Glu 


Val 


Val 


Thr 


Leu 


Thr 


Cys 


Tyr Arg 








20 










25 










30 






Phe 


Gly 


Phe 

35 


Ser 


Val 


Pro 


Gin 


Lys 
40 


Thr 


Lys 


Trp 


Tyr 


Gin 
45 


Lys 


Arg 


Lys 


Thr 


Val 
50 


Lys 


Gin 


Thr 


Pro 


Gly 
55 


Ala 


Leu 


Val 


lie 


Lys 
60 


Ala 


His 


Thr 


Leu 


Lys 


Val 


His 


Glu 


Ser 


Gly 


Glu 


Tyr 


Trp 


Cys 


Gin 


Ala 


Asp 


Ser 


Leu 


Leu 


65 










70 










75 










80 


Pro 


Ser 


Met 


His 


Val 
85 


Asn 


Val 


Glu 


Phe 


Ser 
90 


Glu 


Asp 


Phe 


Leu 


Val 
95 


Leu 


Gin 


Ala 


Pro 


Pro 


Ala 


Val 


Phe 


Glu 


Gly 


Asp 


Ser 


Val 


Val 


Leu 


Arg 


Cys 








100 










105 










110 
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Tvr Ala Lys Lys Gly He Glu Ala Glu Thr Leu Thr Phe Tyr Lys Asp 

115 120 125 

Glv Lvs Ala Leu Thr Leu His His Gin Ser Glu Leu Ser He His His 

130 135 140 

Ala Asn Leu Lys Asp Asn Gly Gin Tyr Lys Cys Thr Ser Lys Lys Lys 

145 150 I 55 16 ° 

Trp Ser Phe Gly Ser Leu Tyr Thr Ser Asn Thr Val Gly Val Gin Val 

165 1*70 l" 75 

Gin Glu Leu Phe Pro Arg Pro Val Leu Arg Ala Arg Pro Ser His Pro 

180 185 I 90 

He Asp Glv Ser Pro Val Thr Leu Thr Cys Gin Thr Gin Leu Ser Ala 

195 200 205 

Gin Lys Ser Asp Ala Arg Leu Gin Phe Cys Phe Phe Arg Asn Leu Gin 

210 215 220 

Leu Leu Gly Ser Gly Cys Ser Arg Ser Ser Glu Phe His lie Pro Ala 
225 230 235 

lie Trp Thr Glu Glu Ser Arg Arg Tyr Gin Cys Lys Ala Glu Thr Val 

245 250 255 

Asn Ser Gin Val Arg Lys Gin Ser Thr Ala Phe He He Pro Val Gin 

260 265 270 

Arg Ala Ser Ala Arg Phe Gin Thr His He He Pro Ala Ser Lys Leu 

275 280 285 

Val Phe Glu Gly Gin Leu Leu Leu Leu Asn Cys Ser Val Lys Gly Val 

290 295 300 

Pro Gly Pro Leu Lys Phe Ser Trp Tyr Lys Lys Asp Met Leu Asn Glu 
305 : 310 315 

Glu Thr Lys He Leu Lys Ser Ser Asn Ala Glu Phe Lys He Ser Gin 

325 330 335 

Val Asn He Ser Asp Ala Gly Glu Tyr His Cys Glu Ala Thr Asn Ser 

340 345 350 

Arg Arg Ser Phe Val Ser Arg Ala Phe Pro He Thr He Lys Val Pro 

355 360 365 

Val Ser Gin Pro Val Leu Thr Leu Ser Thr Gly Lys Thr Gin Ala Leu 

370 375 380 

Glu Gly Asp Leu Met Thr Leu His Cys Gin Ser Gin Arg Gly Ser Pro 
385 390 395 400 

Cvs He Leu Tyr Glu Phe Phe Tyr Glu Asn Val Ser Leu Gly Asn Ser 

405 410 415 

Ser He Leu Ser Gly Gly Gly Ala Tyr Phe Asn Phe Ser Met Ser Thr 

420 425 430 

Glu Arg Ser Gly Asn Tyr Tyr Cys Thr Ala Asp Asn Gly Leu Gly Ala 

435 440 445 

Gin Cys Ser Glu Ala He Arg He Ser He Phe Asp Met Thr Lys Asn 

450 455 460 

Arg Ser Val Pro 
465 

<210> 81 
<211> 79 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note - 
synthetic construct 

<400> 81 , 

Ser Ara Lvs Ala Gly Gly Lys Pro Thr Ser Asp Asp aer Arg Asn Pro 

1 5 10 15 

Ser Asp Ser Glu Pro Gin Glu Pro Thr Tyr Tyr Asn Val Pro Ala Cys 

20 25 . 30 
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lie Glu Leu Gin Pro Val Tyr Ser Asn Glu Pro Glu Glu Asn Val lie 

35 40 45 

Tyr Thr Glu Val Arg Arg Thr Gin Pro Arg Gin Lys His Ala Asp Gin 

50 55 60 

Glu Ser Glu Ser Pro Arg Ser Arg Cys Gin Met Ala Glu Lys Lys 
65 70 75 

<210> 82 
<211> 1973 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 82 

ccacagtgtt 

ctcaaaggga 

ctcctgtcgg 

gagaccaagt 

acttcaagga 

atgcaagacg 

ttatagacac 

cacctgggct 

gcaacacctg 

atggccacac 

aggaagactc 

agagtcaccg 

tccagccttc 

tggtcctggc 

aggacactaa 

tccctactat 

gcctgatcca 

tctccatcag 

aagacaagag 

ctaacacctc 

attctgggaa 

taacgctcaa 

gtgaaggacg 

gggacatgag 

cacctatagc 

tagccctgtg 

attgtggaca 

gatctagaga 

cggtctgcag 

cgaaactata 

ccttgtcccc 

tagttccttc 

catgtgcctg 



ctatcccaga 
gctacttgcc 
agtccagtcc 
agttataagc 
tggatatcac 
tggtgacagt 
aacagaagaa 
gacagccagc 
gctcccttca 
tttgcaatcg 
aggaaattac 
gtcctacata 
aaggggctgg 
ttgttctgtg 
ggaaagtgtg 
cagggaaggc 
gagcgcaatc 
tgttcctggg 
agcatctcct 
ggcacctttt 
ctactcttgt 
tgtcacagag 
cgtagaggtg 
ggatgtggct 
catgctgtat 
caatggcaca 
tgatgaggat 
ccagagacca 
ctcccagtgg 
ttaatctgct 
agaggccaag 
ttgctatctc 
ggcttccatc 



tccgtggtcc 
tctagtctca 
gactggttga 
tgcacaggaa 
atagaaactt 
ggctcctatt 
acaggatcta 
cccctgcagc 
gatagggcaa 
ggctggacct 
tggtgtgaag 
gatgtagaga 
ggagttgaag 
gctaaaggca 
gggaagaaaa 
catgctgggg 
gtgaacatca 
gtcttgccct 
ccggttctct 
ggaggaaagg 
gaggctgaaa 
cccccaccca 
gagcaggaag 
gtggtgtgcc 
ccaccagcag 
gaaaagaccc 
gctggagctg 
tcagacctcc 
acttccagac 
ctgagataat 
cttggaaaaa 
cttgggcaga 
tggtagaatg 



atctgcccta 
tgcctctgtg 
gcatcagcct 
aaaataatgg 
acagcagtgc 
cctgtaaggc 
agtggctgaa 
ccgtagaggg 
cgacccagct 
catcaaaatt 
caatgactgc 
ggatccctgt 
gggagccact 
ccgggctaat 
gtcagcgttc 
ggtactactg 
ccgtgaaaat 
tcatcggaga 
actggtttta 
catcctttaa 
acgcctgggg 
aagtgcgttt 
gtcgctgggg 
gagagctggg 
ttgatgaagc 
tggctgaatg 
tgtgtgaagt 
tactttctgc 
ttcagctgtg 
gttccaacag 
ttgttcccct 
tgcagaggtg 
tggtctaaca 



aggacttgag 
cttgctgctt 
tccacaccgt 
tgacataaag 
ttcaagctac 
agataggaaa 
tgtccaagag 
gagttcagtg 
acgctattcc 
taccatctca 
ctctcgcagt 
atctcaagtc 
ggtcgttgaa 
cacgttctcc 
ccagagagtg 
cacagcagac 
tccagtgttg 
tgtggcggag 
tcatgaaaat 
gctctctctg 
taccaagcgc 
ggtgaatggc 
cactgtatgt 
ctgtggagca 
tctgcctgtg 
tgaccaggtt 
cttacccagc 
actgggcctc 
gcttatcctt 
ctccaaagaa 
gtccaggttc 
gcacaagtaa 
aagcacatac 



ctgcacctgt 
ctggtcttcg 
tcttatgaag 
agactgaagt 
accattagga 
tttttcctat 
ctgtttccag 
accctgtcct 
ttcttcaaag 
gcaatatcga 
gtctcaaagc 
accatggaaa 
ggggagcccc 
tggcataggc 
gagctggaga 
aacaactacg 
aacccgctcc 
cttcactgtg 
atcactctgg 
actgcagggc 
agtgaggtgg 
ccccaccact 
gatgatggct 
gcccaacaca 
ctcattcagg 
gaggcctttg 
actttctgaa 
acagccctca 
caagaggact 
agcccgagtc 
cctgcctttc 
ggatcacata 
aac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1973 



<210> 83 
<211> 1530 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 



<400> 83 
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atacctctgt gcttgctgct tctggtcttc gctcctgtcg gagtccagtc cgactggttg 
aaStcagcc ttccacaccg ttcttatgaa ggagaccaag tagttataag ctgcacagga 
aaaaataatg gtgacataaa gagactgaag tacttcaagg atggatatca catagaaact 
tacagcagtg cttcaagcta caccattagg aatgcaagac gtggtgacag tggctcctat 
tcctgtaagg cagataggaa atttttccta tttatagaca caacagaaga aacaggatct 
aaatagctga atgtccaaga gctgtttcca gcacctgggc tgacagccag ccccctgcag 
cccg^agagg ggagttcagt gaccctgtcc tgcaacacct ggctcccttc agatagggca 
acgacccagc Scgctattc cttcttcaaa gatggccaca ctttgcaatc gggctggacc 
tcatcaaaat ttaccatctc agcaatatcg aaggaagact caggaaatta ctggtgtgaa 
qcaatgactg cetctcgcag tgtctcaaag cagagtcacc ggtcctacat agatgtagag 
aggatccctg tatctcaagt caccatggaa atccagcctt caaggggctg gggagttgaa 
gqqgagccac tggtcgttga aggggagccc ctggtcctgg cttgttctgt ggctaaaggc 
accgggctaa tcacgttctc ctggcatagg caggacacta aggaaagtgt ggggaagaaa 
agtiagcgtt cccagagagt ggagctggag atccctacta tcagggaagg ccatgctggg 
qqqtactact gcacagcaga caacaactac ggcctgatcc agagcgcaat cgtgaacatc 900 
accgJgaaaa ?tccagtg?t gaacccgctc ctctccatca gtgttcctgg TOtcttgccc 9.60 
ttcatiggag atgtggcgga gcttcactgt gaagacaaga gagcatctcc tccggttctc 1020 
tactqqtttt atcatgaaaa tatcactctg gctaacacct cggcaccttt tggaggaaag 
gcatccttta agctctctct gactgcaggg cattctggga actactcttg tgaggctgaa 
aacqcctggg gtaccaagcg cagtgaggtg gtaacgctca atgtcacaga gcccccaccc 
aaaatacg?? ?ggtgaatgg cccccaccac tgtgaaggac gcgtagaggt ggagcaggaa 
ggtcgctggg gcac?gta?g tgatgatggc tgggacatga gggatgtggc tgtggtgtgc 
cgaqaqctqg qctgtggagc agcccaacac acacctatag ccatgctgta tccaccagca 
g?tga?gaag ctctSgcctgt gctcattcag gtagccctgt gcaatggcac agaaaagacc 
ctggctgaat gtgaccaggt tgaggccttt gattgtggac atgatgagga tgctggagct 
gtgtgtgaag tcttacccag cactttctga 



<210> 84 
<211> 1371 
<212> DNA 

<213> Artificial Sequence 

<220> . ^ 

<223> Description of Artificial Sequence : /note - 

synthetic construct 

ccacagtgtt ctatcccaga tccgtggtcc atctgcccta aggacttgag ctgcacctgt 
ctcaaaggga gctacttgcc tctagtctca tgcctctgtg cttgctgctt ctggtcttcg 
cSSgSJg agtccag?cc gactggttga gcatcagcct tccacaccgt tcttatgaag 
gagaccaagt agttataagc tgcacaggaa aaaataatgg tgacataaag agactgaagt 
ac?tcaagga tggatatcac atagaaactt acagcagtgc ttcaagctac accattagga 
atgcaagacg tggtgacagt ggctcctatt cctgtaaggc agataggaaa tttttcctat 
Stagaca? aaLgaagaa acaggatcta agtggctgaa tgtccaagag ctgtttccag 
cacctgggct gacagccagc cccctgcagc ccgtagaggg gagttcagtg ^cctgtcct 
gcaacacctg gctcccttca gatagggcaa cgacccagct acgctattcc "cttcaaag 
atggccacac tttgcaatcg ggctggacct catcaaaatt taccatctca gcaatatcga 
aggaagactc aggaaattac tggtgtgaag caatgactgc ctctcgcagt gtctcaaagc 
agagtcaccg gtcctacata gatgtagaga ggatccctgt atctcaagtc accatggaaa 
tccagccttc aaggggctgg ggagttgaag gggagccact gstcgttgaa g^aj cc 
tggtcctggc ttgttctgtg gctaaaggca ccgggctaat cacgttctcc tggcataggc 
aggacactaa ggaaagtgtg gggaagaaaa gtcagcgttc ccagagagtg gagctggaga 
tccctactat cagggaaggc catgctgggg ggtactactg cacagcagac aacaactacg 
gcctgatcca gagcgcaatc gtgaacatca ccgtgaaaat tccagtgttg "cccgctcc 
tctccatcag tgttcctggg gtcttgccct tcatcggaga tgtggcggag ^ttcactgtg 
aagacaagag agcatctcct ccggttctct actggtttta tcatgaaaat atcactctgg 
ctaacacctc ggcacctttt ggaggaaagg catcctttaa gctctctctg actgcagggc 
attctgggaa ctactcttgt gaggctgaaa acgcctgggg taccaagcgc agtgaggtgg 
taacgctcaa tgtcacaggt aggacaattt aatgatccat tccagggtgc aacttgcctt 
ctggccatgc ccttcttctc tcccttgcac ctgtacctct tggtctttga a 
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<210> 85 
<211> 1203 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 85 

atgcctctgt gcttgctgct tctggtcttc gctcctgtcg gagtccagtc cgactggttg 60 

agcatcagcc ttccacaccg ttcttatgaa ggagaccaag tagttataag ctgcacagga 120 

aaaaataatg gtgacataaa gagactgaag tacttcaagg atggatatca catagaaact 180 

tacagcagtg cttcaagcta caccattagg aatgcaagac gtggtgacag tggctcctat 240 

tcctgtaagg cagataggaa atttttccta tttatagaca caacagaaga aacaggatct 300 

aagtggctga atgtccaaga gctgtttcca gcacctgggc tgacagccag ccccctgcag 360 

cccgtagagg ggagttcagt gaccctgtcc tgcaacacct ggctcccttc agatagggca 420 

acgacccagc tacgctattc cttcttcaaa gatggccaca ctttgcaatc gggctggacc 4 80 

tcatcaaaat ttaccatctc agcaatatcg aaggaagact caggaaatta ctggtgtgaa 540 

gcaatgactg cctctcgcag tgtctcaaag cagagtcacc ggtcctacat agatgtagag 600 

aggatccctg tatctcaagt caccatggaa atccagcctt caaggggctg gggagttgaa 660 

ggggagccac tggtcgttga aggggagccc ctggtcctgg cttgttctgt ggctaaaggc 720 

accgggctaa tcacgttctc ctggcatagg caggacacta aggaaagtgt ggggaagaaa 780 

agtcagcgtt cccagagagt ggagctggag atccctacta tcagggaagg ccatgctggg 840 

gggtactact gcacagcaga caacaactac ggcctgatcc agagcgcaat cgtgaacatc 900 

accgtgaaaa ttccagtgtt gaacccgctc ctctccatca gtgttcctgg ggtcttgccc 960 

ttcatcggag atgtggcgga gcttcactgt gaagacaaga gagcatctcc tccggttctc 1020 

tactggtttt atcatgaaaa tatcactctg gctaacacct cggcaccttt tggaggaaag 1080 

gcatccttta agctctctct gactgcaggg cattctggga actactcttg tgaggctgaa 1140 

aacgcctggg gtaccaagcg cagtgaggtg gtaacgctca atgtcacagg taggacaatt 1200 

taa 1203 



<210> 86 
<211> 1479 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 86 

gactggttga gcatcagcct tccacaccgt tcttatgaag gagaccaagt agttataagc 60 

tgcacaggaa aaaataatgg tgacataaag agactgaagt acttcaagga tggatatcac 120 

atagaaactt acagcagtgc ttcaagctac accattagga atgcaagacg tggtgacagt 180 

ggctcctatt cctgtaaggc agataggaaa tttttcctat ttatagacac aacagaagaa 240 

acaggatcta agtggctgaa tgtccaagag ctgtttccag cacctgggct gacagccagc 300 

cccctgcagc ccgtagaggg gagttcagtg accctgtcct gcaacacctg gctcccttca 360 

gatagggcaa cgacccagct acgctattcc ttcttcaaag atggccacac tttgcaatcg 420 

ggctggacct catcaaaatt taccatctca gcaatatcga aggaagactc aggaaattac 480 

tggtgtgaag caatgactgc ctctcgcagt gtctcaaagc agagtcaccg gtcctacata 540 

gatgtagaga ggatccctgt atctcaagtc accatggaaa tccagccttc aaggggctgg 600 

ggagttgaag gggagccact ggtcgttgaa ggggagcccc tggtcctggc ttgttctgtg 660 

gctaaaggca ccgggctaat cacgttctcc tggcataggc aggacactaa ggaaagtgtg 720 

gggaagaaaa gtcagcgttc ccagagagtg gagctggaga tccctactat cagggaaggc 780 

catgctgggg ggtactactg cacagcagac aacaactacg gcctgatcca gagcgcaatc 840 

gtgaacatca ccgtgaaaat tccagtgttg aacccgctcc tctccatcag tgttcctggg 900 

gtcttgccct tcatcggaga tgtggcggag cttcactgtg aagacaagag agcatctcct 960 

ccggttctct actggtttta tcatgaaaat atcactctgg ctaacacctc ggcacctttt 1020 

ggaggaaagg catcctttaa gctctctctg actgcagggc attctgggaa ctactcttgt ' 1080 

gaggctgaaa acgcctgggg taccaagcgc agtgaggtgg taacgctcaa tgtcacagag 1140 
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1200 
1260 



s=ss sssss ss =s sg 33 l320 

S2£ «E SS =£ 

gaaaagaccc tggctgaatg tgaccaggtt gaggcctttg attgtggaca tgatgaggat 1440 
gctggagctg tgtgtgaagt cttacccagc actttctga 

<210> 87 
<211> 1152 
<212> DNA 

<213> Artificial Sequence 

<220> , . 

<223> Description of Artificial Sequence : /note - 

synthetic construct 

gactggrLa gcatcagcct tccacaccgt tcttatgaag gagaccaagt agttataagc 
?acacaagaa Laataatgg tgacataaag agactgaagt aottcaagga tggatatcac 
a?aqaaact? acagcagtgc ttcaagctac accattagga atgcaagacg tggtgacagt 
acc?cctatt cctgtaaggc agataggaaa tttttcctat ttatagacac aacagaagaa 
acaaaateS agtggctgaa tgtccaagag ctgtttccag cacctgggct gacagccagc 
rcSgcagc ccgfagaggg gagttcagtg accctgtcct scaacacctg gtcccttc. 
mhaoaacaa cgacccagct acgctattcc ttcttcaaag atggccacac tttgcaatcg 
qqctaqacct catcaaaatt taccatctca gcaatatcga aggaagactc aggaaattac 
?ggtg?gaag oaatgactgc ctctcgcagt gtctcaaagc agagtcaccg gtcctacata 
ga?g?agaga ggatccctgt atctcaagtc accatggaaa tccagccttc -ggggctog 
aqagttgaag gggagccact ggtcgttgaa ggggagcccc tggtcctggc ttgttctgtg 
octaaaggJa ccgggctaat cacgttctcc tggcataggc aggacactaa ggaaagtgtg 
aaaaaaaaaa gtcaqcgttc ccagagagtg gagctggaga tccctactat cagggaaggc 
ca?gc?gggg ggtactactg cacagcagac aaoaactacg gcctgatcca jagcgcaatc 
grgaaca?ca ccgtgaaaat tccagtgttg aacccgctoo tctccatcag tgttcctggg 
atcttgccct tcatcggaga tgtggcggag cttcactgtg aagacaagag agcatctcct 
ccggtrctct actggt?tta tcatgaaaat atcactctgg ctaacacctc OT«cctttt 
ggaggaaagg catcctttaa gctctctctg actgcagggc attctgggaa ^actcttgt 
gaggctgaaa acgcctgggg taccaagcgc agtgaggtgg taacgctcaa tgtcacaggt 
aggacaattt aa 

<210> 88 
<211> 1567 
<212> DNA 

<213> Artificial Sequence 

<220> . . 

<223> Description of Artificial Sequence : /note - 

synthetic construct 

taaqtaqtct ggttttgctt ttttttcttt tggtgagagg taccttcaag ttaccatccc 
cKccJggtc Scatgctac cttggctcct gctactgatc tgtgctctac cgtgtgaacc 
tgctggaatc tctgatgtga gcttgaagac acggccccca ggaggatggg tgatggaggg 
agacaagctg gtcctcatct gctcggttga tagagtcact gggaatataa jttacttctg 
atacaaaqcra accctgggtt tccaactgga aacaaagaca caaccttcac taacagcaga 
SSSSS agtgacSga agcagagcga tgctgatcaa tattactgtg cggctaacga 
tggcLcgac cctatcgcca gtgagctggt gagcatccac gtcagagttc jagtgtctcg 
ccctgtcctt acgtttgggg actctggaac ccaggctgtg ctaggggacc tggtggagct 
tcaclgtaag gccctgagag gctcaccccc aatcttctac cagttttatc ^gagagcat 
catcc?gggg aacagttcag caccctctgg aggaggagca tccttcaact tctccctgac 
tgcagaacat tctggaaact tctcctgtga ggccagcaat ggacagggtg cccaacgaag 
tgaggtggtg gctctcaact taacaggtct ctccttagtg cctactgaga ^ggaatcag 
ccatctctcc ttaggactca ctgggtggct gcttggctgt cttagcccca tcaccatggc 
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cttaatattt tgctactggc tcaagagaaa aataggaaga cagtcagagg atccagtcag 840 

gagccctcct cagactgtgc tccaaggatc cacgtacccc aaatcccccg actcaaggca 900 

gccagagccc ctgtatgaga acgtgaacgt tgtaagtggc aatgaagtgt actctctggt 960 

gtaccacacc ccgcaggtgc tggaaccagc agcagctcag catgtgagga cacacggagt 1020 

aagtgagtcc tttcaggtct cctctggact ctattctaag ccaaggataa acattgcaca 1080 

tatggactat gaagacgcca tgtagaatta tgtaaacagc aactatggag tgctacatac 1140 

aagcccaagg cctgatgtgg cctccaagga tactggggac agggatagct tgccagccca 1200 

atttccccac acactgcggt tcattagatg agtccttcac ctaccctgtg tgaagctgga 1260 

gcaagtcctg cagaaaccac ccaggaaaac caacttagac ggagaagcca gaagcatttg 1320 

catctggttg ttgcccattc atgttggcac acgaactttt atttacagga ggaaaatggt 1380 

gtgatgaaag caactaaggt cttacagcag agggacaatg cgactcagag agcacaaagc 1440 

cgagatcaat ggctttgcag gtctgctgtg gagacagagc catgcttcct ctgtgcacat 1500 

accctagagt acttctgagt cactgccatc aacttagaat taaacacagt tgcataaaat 1560 

gtactgt 1567 



<210> 89 
<211> 1032 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 89 

atgctacctt ggctcctgct actgatctgt gctctaccgt gtgaacctgc tggaatctct 60 

gatgtgagct tgaagacacg gcccccagga ggatgggtga tggagggaga caagctggtc 120 

ctcatctgct cggttgatag agtcactggg aatataactt acttctggta cagaggggcc 180 

ctgggtttcc aactggaaac aaagacacaa ccttcactaa cagcagagtt tgagatcagt 240 

gacatgaagc agagcgatgc tgatcaatat tactgtgcgg ctaacgatgg ccacgaccct 300 

atcgccagtg agctggtgag catccacgtc agagttccag tgtctcgccc tgtccttacg 360 

tttggggact ctggaaccca ggctgtgcta ggggacctgg tggagcttca ctgtaaggcc 420 

ctgagaggct cacccccaat cttctaccag ttttatcatg agagcatcat cctggggaac 480 

agttcagcac cctctggagg aggagcatcc ttcaacttct ccctgactgc agaacattct 540 

ggaaacttct cctgtgaggc cagcaatgga cagggtgccc aacgaagtga ggtggtggct 600 

ctcaacttaa caggtctctc cttagtgcct actgagaatg gaatcagcca tctctcctta 660 

ggactcactg ggtggctgct tggctgtctt agccccatca ccatggcctt aatattttgc 720 

tactggctca agagaaaaat aggaagacag tcagaggatc cagtcaggag ccctcctcag 780 

actgtgctcc aaggatccac gtaccccaaa tcccccgact caaggcagcc agagcccctg 840 

tatgagaacg tgaacgttgt aagtggcaat gaagtgtact ctctggtgta ccacaccccg 900 

caggtgctgg aaccagcagc agctcagcat gtgaggacac acggagtaag tgagtccttt 960 

caggtctcct ctggactcta ttctaagcca aggataaaca ttgcacatat ggactatgaa 1020 

gacgccatgt ag 1032 



<210> 90 
<211> 981 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 90 

ggaatctctg atgtgagctt gaagacacgg 

aagctggtcc tcatctgctc ggttgataga 

agaggggccc tgggtttcca actggaaaca 

gagatcagtg acatgaagca gagcgatgct 

cacgacccta tcgccagtga gctggtgagc 

gtccttacgt ttggggactc tggaacccag 

tgtaaggccc tgagaggctc acccccaatc 



cccccaggag gatgggtgat ggagggagac 60 

gtcactggga atataactta cttctggtac 120 

aagacacaac cttcactaac agcagagttt 180 

gatcaatatt actgtgcggc taacgatggc 240 

atccacgtca gagttccagt gtctcgccct 300 

gctgtgctag gggacctggt ggagcttcac 360 

ttctaccagt tttatcatga gagcatcatc 420 



45/52 



WO 03/089624 PCT/US03/09600 



ctgqggaaca gttcagcacc ctctggagga ggagcatcct tcaacttctc ^ctgactgca 
aaacattctg gaaacttctc ctgtgaggcc agcaatggac agggtgccca acgaagtgag 
gtqgtggctc Laacttaac aggtctctcc ttagtgccta ctgagaatgg aatcagccat 
ctltclttag gactcactgg gtggctgctt ggctgtctta gccccatcac catggcctta 
aStttac? Ltggctcaa gagaaaaata ggaagacagt cagaggatcc agtcaggagc 
cctc"caga ctgtgctcca aggatccacg taccccaaat cccccgactc jaggcagcca 
aaacccctat atgagaacgt gaacgttgta agtggcaatg aagtgtactc tctggtgtac 840 
cacaccccgc aggtgctgga accagcagca gctcagcatg tgaggacaca cggagtaagt 900 
gagtcctSc aggtctccL tggactctat tctaagccaa ggataaacat tgcacatatg 960 
gactatgaag acgccatgta g 



<210> 91 
<211> 660 
<212> DNA 

<213> Artificial Sequence 

<220> . _ _ 

<223> Description of Artificial Sequence: /note - 

synthetic construct 
<400> 91 



ctctcctta 

<210> 93 
<2il> 303 
<212> DNA 

<213>. Artificial Sequence 
<220> 
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atactacctt ggctcctgct actgatctgt gctctaccgt gtgaacctgc tggaatctct 
MtrtSSct ?gaagacacg gcccccagga ggatgggtga tggagggaga caagctggtc 
ScatcSct cggtJgatag agtcactggg aatataactt acttctggta cagaggggcc 
cSSatttcc aactggaaac aaagacacaa ccttcactaa cagcagagtt tgagatcagt 240 
aaca?aaaqc agSgatgc tgatcaatat tactgtgcgg ctaacgatgg ccacgaccct 300 
atcgccaSg agSggtgag catccacgtc agagttccag tgtctcgccc tgtccttacg 360 
^ttggggac? ctggaaccca ggctgtgcta ggggacctgg tggagcttca ctgtaaggcc 420 
ctaaSaact cacccccaat cttctaccag ttttatcatg agagcatcat cctggggaac 480 
Sg?S 9 cac cctctggagg aggagcatcc ttcaacttct ccctgactgc agaacattct 
ggaaac?tct cctgtgaggc cagcaatgga cagggtgccc aacgaagtga ^ggtggct 600 
ctcaacttaa caggtctctc cttagtgcct actgagaatg gaatcagcca tctctcctta 

<210> 92 

<211> 609 

<212> DNA 

<213> Artificial Sequence 

<220> . . A 

<223> Description of Artificial Sequence : /note - 

synthetic construct 

ggaatctctg atgtgagctt gaagacacgg cccccaggag gatgggtgat mn^ 60 
aaactaqtcc tcatctgctc ggttgataga gtcactggga atataactta cttctggtac 
aqaagqgccc tgggtttcca actggaaaca aagacacaac cttcactaac agcagagttt 180 
gagatcagtg acatgaagca gagcgatgct gatcaatatt actgtgcggc taacgatggc 240 
raraacccta tcgccagtga gctggtgagc atccacgtca gagttccagt gtctcgccct juu 
atcStacgt ttggggac?c ?ggaacccag gctgtgctag gggacctggt ggagcttcac 360 
?gtaaggccc tgagaggctc acccccaatc ttctaccagt tttatcatga ^gcatcatc 420 
ctggggaaca gttcagcacc ctctggagga ggagcatcct tcaacttctc cctgactgca 
gaaca??ctg gaaacttctc ctgtgaggcc agcaatggac agggtgccca acgaagtgag 
gtggtggctc tcaacttaac aggtctctcc ttagtgccta ctgagaatgg aatcagccat 



480 
540 
600 
609 
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<223> Description of Artificial Sequence: /note = 
synthetic construct 

<400> 93 

aagagaaaaa taggaagaca gtcagaggat ccagtcagga gccctcctca gactgtgctc 60 

caaggatcca cgtaccccaa atcccccgac tcaaggcagc cagagcccct gtatgagaac 120 

gtgaacgttg taagtggcaa tgaagtgtac tctctggtgt accacacccc gcaggtgctg 180 

gaaccagcag cagctcagca tgtgaggaca cacggagtaa gtgagtcctt tcaggtctcc 240 

tctggactct attctaagcc aaggataaac attgcacata tggactatga agacgccatg 300 

tag 303 

<210> 94 
<211> 1567 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 94 

tgagtagtct ggttttgctt ttttttcttt tggtgagagg taccttcaag ttaccatccc 60 

cagcctggtc ctcatgctac cttggctcct gctactgatc tgtgctctac cgtgtgaacc 120 

tgctggaatc tctgatgtga gcttgaagac acggccccca ggaggatggg tgatggaggg 180 

agacaagctg gtcctcatct gctcggttga tagagtcact gggaatataa cttacttctg 2 40 

gtacagaggg gccctgggtt tccaactgga aacaaagaca caaccttcac taacagcaga 300 

gtttgagatc agtgacatga agcagagcga tgctgatcaa tattactgtg cggctaacga 360 

tggccacgac cctatcgcca gtgagctggt gagcatccac gtcagagttc cagtgtctcg 420 

ccctgtcctt acgtttgggg actctggaac ccaggctgtg ctaggggacc tggtggagct 480 

tcactgtaag gccctgagag gctcaccccc aatcttctac cagttttatc atgagagcat 54 0 

catcctgggg aacagttcag caccctctgg aggaggagca tccttcaact tctccctgac 600 

tgcagaacat tctggaaact tctcctgtga ggccagcaat ggacagggtg cccaacgaag 660 

tgaggtggtg gctctcaact taacaggtct ctccttagtg cctactgaga atggaatcag 72 0 

ccatctctcc ttaggactca ctgggtggct gcttggctgt cttagcccca tcaccatggc 780 

cttaatattt tgctactggc tcaagagaaa aataggaaga cagtcagagg atccagtcag 840 

gagccctcct cagactgtgc tccaaggatc cacgtacccc aaatcccccg actcaaggca 900 

gccagagccc ctgtatgaga acgtgaacgt tgtaagtggc aatgaagtgt actctctggt 960 

gtaccacacc ccgcaggtgc tggaaccagc agcagctcag catgtgagga cacacggagt 1020 

aagtgagtcc tttcaggtct cctctggact ctattctaag ccaaggataa acattgcaca 1080 

tatggactat gaagacgcca tgtagaatta tgtaaacagc aactatggag tgctacatac 1140 

aagcccaagg cctgatgtgg cctccaagga tactggggac agggatagct tgccagccca 1200 

atttccccac acactgcggt tcattagatg agtccttcac ctaccctgtg tgaagctgga 1260 

gcaagtcctg cagaaaccac ccaggaaaac caacttagac ggagaagcca gaagcatttg 132 0 

catctggttg ttgcccattc atgttggcac acgaactttt atttacagga ggaaaatggt 138 0 

gtgatgaaag caactaaggt cttacagcag agggacaatg cgactcagag agcacaaagc 1440 

cgagatcaat ggctttgcag gtctgctgtg gagacagagc catgcttcct ctgtgcacat 1500 

accctagagt acttctgagt cactgccatc aacttagaat taaacacagt tgcataaaat 1560 

gtactgt 1567 

<210> 95 
<211> 903 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: /note = 
synthetic construct 

<400> 95 

atgctacctt ggctcctgct actgatctgt gctctaccgt gtgaacctgc tggaatctct 60 

gatgtgagct tgaagacacg gcccccagga ggatgggtga tggagggaga caagctggtc 120 
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ctcatctgct cggttgatag agtcactggg aatataactt acttctggta "Wwcc 
ctaqqtttcc aactggaaac aaagacacaa ccttcactaa cagcagagtt tgagatcagt 
qacatgaagc agagcgatgc tgatcaatat tactgtgcgg ctaacgatgg ccacgaccct 
a?cgccag?g agctggtgag catccacgtc agagttccag tgtctcgccc tgtccttacg 
tttggggact ctggXccca ggctgtgcta ggggacctgg tggagcttca ctgtaaggcc 
ctgagaggct cacccccaat cttctaccag ttttatcatg agagcatcat cctggggaac 
ag?tcagcac cctctggagg aggagcatcc ttcaacttct ccctgactgc agaacattct 
ggaaacttct cctgtgaggc cagcaatgga cagggtgccc aacgaagtga ggtggtggct 
Scaacttaa caggaagaca gtcagaggat ccagtcagga gccctcctca ^^gtgctc 
caaggatcca cgtaccccaa atcccccgac tcaaggcagc cagagcccct gtatgagaac 
atqaacgttg taagtggcaa tgaagtgtac tctctggtgt accacacccc gcaggtgctg 
qaaccagcag cagctcagca tgtgaggaca cacggagtaa gtgagtcctt tcaggtctcc 
?c?ggactc? attctaajcc aaggataaac attgcacata tggactatga agacgccatg 



tag 

<210> 96 
<211> 852 
<212> DNA 

<213> Artificial Sequence 



<220> . . 

<223> Description of Artificial Sequence : /note - 

synthetic construct 

ggaarctctg atgtgagctt gaagacacgg cccccaggag gatgggtgat 9^^^ 
aagctggtcc tcatctgctc ggttgataga gtcactggga atataactta cttctggtac 
agaggggccc tgggtttcca actggaaaca aagacacaac cttcactaac agcagagttt 
gaga?cagtg acatgaagca gagcgatgct gatcaatatt actgtgcggc taacgatggc 
cacgacccta tcgccagtga gctggtgagc atccacgtca gagttccagt gtctcgccct 
gtccttacgt ttggggactc tggaacccag gctgtgctag gggacctggt ggagcttcac 
?gtaaggccc tgagaggctc acccccaatc ttctaccagt tttatcatga gagcatcatc 
Sqgggaaca g?tcagcacc ctctggagga ggagcatcct tcaacttctc cctgactgca 
gaacattctg gaaacttctc ctgtgaggcc agcaatggac agggtgccca acgaagtgag 
: tggt ggctc tcaacttaac aggaagacag tcagaggatc cagtcaggag ccctcctcag 
ac?gtgctcc aaggatccac gtaccccaaa tcccccgact caaggcagcc jgagcccctg 
tatgagaacg tgaacgttgt aagtggcaat gaagtgtact ctctggtgta ccacaccccg 
caggtgctgg aaccagcagc agctcagcat gtgaggacac acggagtaag tgagtccttt 
caggtctcct ctggactcta ttctaagcca aggataaaca ttgcacatat ggactatgaa 
gacgccatgt ag 

<210> 97 
<211> 2447 
<212> DNA 

<213> Artificial Sequence 

<220> . . 

<223> Description of Artificial Sequence : /note - 

synthetic construct 

attcaagtta cactcaactg ttttagaaga gcagttcccc agatttctcc ttggagctgt 
qagtgactac cattgcgagc aagagcaaga ggaaagcact acctgtgagc agatgtctgg 
ttcattctca ccctgtgtgg tgttcacaca gatgtggctg actctactgg ttgtgactcc 
tgtcaatgga cagcatgaag ctgcacagca gtctgtggtt tcccttcagc ctccatggac 
cactttcttt cgaggagagg tcgtcacact gacttgttat agattcggct tctccgtacc 
ccagaaaaca aaatggtacc agaaaagaaa aacagtgaag caaaccccag gtgctttggt 
aattaaagca cataccttaa aggtccatga gtccggagag tattggtgcc "gccgacag 
cttacttccg agcatgcacg tgaacgtaga gttttctgaa gattttctgg tgctgcaagc 
tccacctgct gtgtttgaag gagactctgt ggttctgagg tgctacgcaa agaaaggcat 
agaagcagag accctgacat tttacaagga tggtaaagct ctgacattac atcatcaaag 
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tgagctctct attcatcatg caaatctgaa ggacaacggt caatacaaat gcacttcgaa 660 

gaagaagtgg tcttttgggt ccctctatac ttccaatacg gtcggagttc aagtccaaga 720 

gttgttccca cggcctgtgc tgagagccag accctcccat cccatagatg gaagtccagt 780 

gaccctgacg tgtcagaccc agctctctgc acagaagtca gatgcccggc tccagttctg 840 

tttcttcaga aacctccagc ttctggggtc aggctgcagc cgctcctcag agtttcacat 900 

tcctgccata tggactgaag agtcaaggag ataccagtgc aaggcagaaa cagtgaattc 960 

ccaagttaga aaacaaagta cagcgttcat aatcccagtg cagagagctt ctgcgagatt 1020 

ccaaacacac atcatcccag cctcaaagtt ggtgtttgaa gggcagttgc tgttactcaa 1080 

ctgctcagta aaaggagtyc caggrcccct caaattctcc tggtataaaa aggacatgct 1140 

gaatgaagaa acaaagattc ttaagtcctc caacgcagaa ttcaagatct cccaggtgaa 1200 

catcagtgac gcaggggagt atcactgtga agctaccaac agccgccgaa gctttgtcag 1260 

cagggcattt cccatcacca taaaagtccc agtatctcaa ccagttctca ccctaagcac 1320 

aggcaagacc caggcccttg agggagactt gatgacactt cattgtcaat cccagagggg 1380 

ctctccatgt atcctgtatg aattcttcta tgagaatgtc tccctgggga atagctctat 14 40 

actctctgga ggaggagcat acttcaattt ctctatgagc acagagcgat ctggaaacta 1500 

ctactgcaca gcagacaatg gcctgggagc ccagtgcagt gaagctataa ggatctctat 15 60 

ctttgacatg acaaagaaca gaagtgttcc tatggctgcc ggaatcactg tgggactgct 1620 

catcatggct gttggagtgt ttctgtttta ttgctggttc tctagaaaag caggaggaaa 1680 

gcctacctct gatgactcca gaaacccttc agattcagaa ccccaggagc ccacctatta 1740 

caacgtacca gcctgtatag aactgcagcc agtgtacagc aatgagcctg aggaaaacgt 1800 

gatttacaca gaagtacgga gaactcaacc aagacagaaa catgcagatc aggagtctga 18 60 

aagcccaaga tcaaggtgcc agatggctga gaaaaagtag gatatgtctc ctccaagaac 1920 

agctccagaa aagaaacccg aagcttcgtc agtctaatct caccgatgct tctactgggc 1980 

ctgcactttc ctacccacgg atggctccac agatcatgga cagcaaggaa atggccaact 2040 

ctcctaagac tgggccaaca tccccatctt ctctttggtt tcccagagcc acgccacccc 2100 

aaagtcagca ggaagttgca aaagatcaca acgaccctat tcctgttttg taaccacccc 2160 

cagcctgaag caggctgagc cagaccttga ccttgctgcc actaaggaga ttacctaggg 2220 

tggagcctgc ctctctagat cactctattg ttcagccact gccactgttc tccttcaaga 22 80 

cactgctacc tgctgggagg ccactgagct attccagaga ctacacccta tcctgcacat 2340 

catcacctgt agcctgttcc aggctccaag aatgaattgg cggcaatggg cctcccccct 2400 

acccccttta taagtgcatt tgccattaaa catttgggct ttgatct 2447 



<210> 98 
<211> 1788 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 98 

atgtctggtt cattctcacc ctgtgtggtg ttcacacaga tgtggctgac tctactggtt 60 

gtgactcctg tcaatggaca gcatgaagct gcacagcagt ctgtggtttc ccttcagcct 120 

ccatggacca ctttctttcg aggagaggtc gtcacactga cttgttatag attcggcttc 180 

tccgtacccc agaaaacaaa atggtaccag aaaagaaaaa cagtgaagca aaccccaggt 24 0 

gctttggtaa ttaaagcaca taccttaaag gtccatgagt ccggagagta ttggtgccaa 300 

gccgacagct tacttccgag catgcacgtg aacgtagagt tttctgaaga ttttctggtg 3 60 

ctgcaagctc cacctgctgt gtttgaagga gactctgtgg ttctgaggtg ctacgcaaag 420 

aaaggcatag aagcagagac cctgacattt tacaaggatg gtaaagctct gacattacat 480 

catcaaagtg agctctctat tcatcatgca aatctgaagg acaacggtca atacaaatgc 54 0 

acttcgaaga agaagtggtc ttttgggtcc ctctatactt ccaatacggt cggagttcaa 600 

gtccaagagt tgttcccacg gcctgtgctg agagccagac cctcccatcc catagatgga 660 

agtccagtga ccctgacgtg tcagacccag ctctctgcac agaagtcaga tgcccggctc 720 

cagttctgtt tcttcagaaa cctccagctt ctggggtcag gctgcagccg ctcctcagag 780 

tttcacattc ctgccatatg gactgaagag tcaaggagat accagtgcaa ggcagaaaca 840 

gtgaattccc aagttagaaa acaaagtaca gcgttcataa tcccagtgca gagagcttct 900 

gcgagattcc aaacacacat catcccagcc tcaaagttgg tgtttgaagg gcagttgctg 960 

ttactcaact gctcagtaaa aggagtycca ggrcccctca aattctcctg gtataaaaag 1020 

gacatgctga atgaagaaac aaagattctt aagtcctcca acgcagaatt caagatctcc 1080 

caggtgaaca tcagtgacgc aggggagtat cactgtgaag ctaccaacag ccgccgaagc 1140 
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tttgtcagca 

ctaagcacag 

cagaggggct 

agctctatac 

ggaaactact 

atctctatct 

ggactgctca 

ggaggaaagc 

acctattaca 

gaaaacgtga 

gagtctgaaa 



gggcatttcc 

gcaagaccca 

ctccatgtat 

tctctggagg 

actgcacagc 

ttgacatgac 

tcatggctgt 

ctacctctga 

acgtaccagc 

tttacacaga 

gcccaagatc 



catcaccata 

ggcccttgag 

cctgtatgaa 

aggagcatac 

agacaatggc 

aaagaacaga 

tggagtgttt 

tgactccaga 

ctgtatagaa 

agtacggaga 

aaggtgccag 



aaagtcccag 

ggagacttga 

ttcttctatg 

ttcaatttct 

ctgggagccc 

agtgttccta 

ctgttttatt 

aacccttcag 

ctgcagccag 

actcaaccaa 

atggctgaga 



tatctcaacc 

tgacacttca 

agaatgtctc 

ctatgagcac 

agtgcagtga 

tggctgccgg 

gctggttctc 

attcagaacc 

tgtacagcaa 

gacagaaaca 

aaaagtag 



agttctcacc 

ttgtcaatcc 

cctggggaat 

agagcgatct 

agctataagg 

aatcactgtg 

tagaaaagca 

ccaggagccc 

tgagcctgag 

tgcagatcag 



<210> 99 
<211> 1710 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : /note - 
synthetic construct 



<400> 99 
cagcatgaag 
cgaggagagg 
aaatggtacc 
cataccttaa 
agcatgcacg 
gtgtttgaag 
accctgacat 
attcatcatg 
tcttttgggt 
cggcctgtgc 
tgtcagaccc 
aacctccagc 
tggactgaag 
aaacaaagta 
atcatcccag 
aaaggagtyc 
acaaagattc 
gcaggggagt 
cccatcacca 
caggcccttg 
atcctgtatg 
ggaggagcat 
gcagacaatg 
acaaagaaca 
gttggagtgt 
gatgactcca 
gcctgtatag 
gaagtacgga 
tcaaggtgcc 



ctgcacagca 
tcgtcacact 
agaaaagaaa 
aggtccatga 
tgaacgtaga 
gagactctgt 
tttacaagga 
caaatctgaa 
ccctctatac 
tgagagccag 
agctctctgc 
ttctggggtc 
agtcaaggag 
cagcgttcat 
cctcaaagtt 
caggrcccct 
ttaagtcctc 
atcactgtga 
taaaagtccc 
agggagactt 
aattcttcta 
acttcaattt 
gcctgggagc 
gaagtgttcc 
ttctgtttta 
gaaacccttc 
aactgcagcc 
gaactcaacc 
agatggctga 



gtctgtggtt 
gacttgttat 
aacagtgaag 
gtccggagag 
gttttctgaa 
ggttctgagg 
tggtaaagct 
ggacaacggt 
ttccaatacg 
accctcccat 
acagaagtca 
aggctgcagc 
ataccagtgc 
aatcccagtg 
ggtgtttgaa 
caaattctcc 



caacgcagaa 
agctaccaac 
agtatctcaa 
gatgacactt 
tgagaatgtc 
ctctatgagc 
ccagtgcagt 
tatggctgcc 
ttgctggttc 
agattcagaa 
agtgtacagc 
aagacagaaa 
gaaaaagtag 



tcccttcagc 
agattcggct 
caaaccccag 
tattggtgcc 
gattttctgg 
tgctacgcaa 
ctgacattac 
caatacaaat 
gtcggagttc 
cccatagatg 
gatgcccggc 
cgctcctcag 
aaggcagaaa 
cagagagctt 
gggcagttgc 
tggtataaaa 
ttcaagatct 
agccgccgaa 
ccagttctca 
cattgtcaat 
tccctgggga 
acagagcgat 
gaagctataa 
ggaatcactg 
tctagaaaag 
ccccaggagc 
aatgagcctg 
catgcagatc 



ctccatggac 
tctccgtacc 
gtgctttggt 
aagccgacag 
tgctgcaagc 
agaaaggcat 
atcatcaaag 
gcacttcgaa 
aagtccaaga 
gaagtccagt 
tccagttctg 
agtttcacat 
cagtgaattc 
ctgcgagatt 
tgttactcaa 
aggacatgct 
cccaggtgaa 
gctttgtcag 
ccctaagcac 
cccagagggg 
atagctctat 
ctggaaacta 
ggatctctat 
tgggactgct 
caggaggaaa 
ccacctatta 
aggaaaacgt 
aggagtctga 



cactttcttt 
ccagaaaaca 
aattaaagca 
cttacttccg 
tccacctgct 
agaagcagag 
tgagctctct 
gaagaagtgg 
gttgttccca 
gaccctgacg 
tttcttcaga 
tcctgccata 
ccaagttaga 
ccaaacacac 
ctgctcagta 
gaatgaagaa 
catcagtgac 
cagggcattt 
aggcaagacc 
ctctccatgt 
actctctgga 
ctactgcaca 
ctttgacatg 
catcatggct 
gcctacctct 
caacgtacca 
gatttacaca 
aagcccaaga 
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<210> 100 
<211> 1401 
<212> DNA 

<213> Artificial Sequence 



<220> 



<223> Description of Artificial Sequence: /note - 
synthetic construct 
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<400> 100 

cagcatgaag 

cgaggagagg 

aaatggtacc 

cataccttaa 

agcatgcacg 

gtgtttgaag 

accctgacat 

attcatcatg 

tcttttgggt 

cggcctgtgc 

tgtcagaccc 

aacctccagc 

tggactgaag 

aaacaaagta 

atcatcccag 

aaaggagtyc 

acaaagattc 

gcaggggagt 

cccatcacca 

caggcccttg 

atcctgtatg 

ggaggagcat 

gcagacaatg 

acaaagaaca 



ctgcacagca 
tcgtcacact 
agaaaagaaa 
aggtccatga 
tgaacgtaga 
gagactctgt 
tttacaagga 
caaatctgaa 
ccctctatac 
tgagagccag 
agctctctgc 
ttctggggtc 
agtcaaggag 
cagcgttcat 
cctcaaagtt 
caggrcccct 
ttaagtcctc 
atcactgtga 
taaaagtccc 
agggagactt 
aattcttcta 
acttcaattt 
gcctgggagc 
gaagtgttcc 



gtctgtggtt 
gacttgttat 
aacagtgaag 
gtccggagag 
gttttctgaa 
ggttctgagg 
tggtaaagct 
ggacaacggt 
ttccaatacg 
accctcccat 
acagaagtca 
aggctgcagc 
ataccagtgc 
aatcccagtg 
ggtgtttgaa 
caaattctcc 
caacgcagaa 
agctaccaac 
agtatctcaa 
gatgacactt 
tgagaatgtc 
ctctatgagc 
ccagtgcagt 
t 



tcccttcagc 
agattcggct 
caaaccccag 
tattggtgcc 
gattttctgg 
tgctacgcaa 
ctgacattac 
caatacaaat 
gtcggagttc 
cccatagatg 
gatgcccggc 
cgctcctcag 
aaggcagaaa 
cagagagctt 
gggcagttgc 
tggtataaaa 
ttcaagatct 
agccgccgaa 
ccagttctca 
cattgtcaat 
tccctgggga 
acagagcgat 
gaagctataa 



ctccatggac 
tctccgtacc 
gtgctttggt 
aagccgacag 
tgctgcaagc 
agaaaggcat 
atcatcaaag 
gcacttcgaa 
aagtccaaga 
gaagtccagt 
tccagttctg 
agtttcacat 
cagtgaattc 
ctgcgagatt 
tgttactcaa 
aggacatgct 
cccaggtgaa 
gctttgtcag 
ccctaagcac 
cccagagggg 
atagctctat 
ctggaaacta 
ggatctctat 



cactttcttt 
ccagaaaaca 
aattaaagca 
cttacttccg 
tccacctgct 
agaagcagag 
tgagctctct 
gaagaagtgg 
gttgttccca 
gaccctgacg 
tttcttcaga 
tcctgccata 
ccaagttaga 
ccaaacacac 
ctgctcagta 
gaatgaagaa 
catcagtgac 
cagggcattt 
aggcaagacc 
ctctccatgt 
actctctgga 
ctactgcaca 
ctttgacatg 



<210> 101 
<211> 1479 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 



<400> 101 

atgtctggtt 

gtgactcctg 

ccatggacca 

tccgtacccc 

gctttggtaa 

gccgacagct 

ctgcaagctc 

aaaggcatag 

catcaaagtg 

acttcgaaga 

gtccaagagt 

agtccagtga 

cagttctgtt 

tttcacattc 

gtgaattccc 

gcgagattcc 

ttactcaact 

gacatgctga 

caggtgaaca 

tttgtcagca 

ctaagcacag 

cagaggggct 

agctctatac 

ggaaactact 

atctctatct 



cattctcacc 
tcaatggaca 
ctttctttcg 
agaaaacaaa 
ttaaagcaca 
tacttccgag 
cacctgctgt 
aagcagagac 
agctctctat 
agaagtggtc 
tgttcccacg 
ccctgacgtg 
tcttcagaaa 
ctgccatatg 
aagttagaaa 
aaacacacat 
gctcagtaaa 
atgaagaaac 
tcagtgacgc 
gggcatttcc 
gcaagaccca 
ctccatgtat 
tctctggagg 
actgcacagc 
ttgacatgac 



ctgtgtggtg 
gcatgaagct 
aggagaggtc 
atggtaccag 
taccttaaag 
catgcacgtg 
gtttgaagga 
cctgacattt 
tcatcatgca 
ttttgggtcc 
gcctgtgctg 
tcagacccag 
cctccagctt 
gactgaagag 
acaaagtaca 
catcccagcc 
aggagtycca 
aaagattctt 
aggggagtat 
catcaccata 
ggcccttgag 
cctgtatgaa 
aggagcatac 
agacaatggc 
aaagaacaga 



ttcacacaga 
gcacagcagt 
gtcacactga 
aaaagaaaaa 
gtccatgagt 
aacgtagagt 
gactctgtgg 
tacaaggatg 
aatctgaagg 
ctctatactt 
agagccagac 
ctctctgcac 
ctggggtcag 
tcaaggagat 
gcgttcataa 
tcaaagttgg 
ggrcccctca 
aagtcctcca 
cactgtgaag 
aaagtcccag 
ggagacttga 
ttcttctatg 
ttcaatttct 
ctgggagccc 
agtgttcct 



tgtggctgac 
ctgtggtttc 
cttgttatag 
cagtgaagca 
ccggagagta 
tttctgaaga 
ttctgaggtg 
gtaaagctct 
acaacggtca 
ccaatacggt 
cctcccatcc 
agaagtcaga 
gctgcagccg 
accagtgcaa 
tcccagtgca 
tgtttgaagg 
aattctcctg 
acgcagaatt 
ctaccaacag 
tatctcaacc 
tgacacttca 
agaatgtctc 
ctatgagcac 
agtgcagtga 



tctactggtt 
ccttcagcct 
attcggcttc 
aaccccaggt 
ttggtgccaa 
ttttctggtg 
ctacgcaaag 
gacattacat 
atacaaatgc 
cggagttcaa 
catagatgga 
tgcccggctc 
ctcctcagag 
ggcagaaaca 
gagagcttct 
gcagttgctg 
gtataaaaag 
caagatctcc 
ccgccgaagc 
agttctcacc 
ttgtcaatcc 
cctggggaat 
agagcgatct 
agctataagg 
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<210> 102 
<211> 240 
<212> DNA 

<213> Artificial Sequence 
<22 0> 

<223> Description of Artificial Sequence: /note = 
synthetic construct 

<400> 102 



Jcr.aaaaaag caggaggaaa gcctacctct gatgactcca gaaacccttc agattcagaa 
SESS Sccfatta aacgtacca .cctgtatag aactgcagcc . gtacagc ^ 

SJSSS "fa far™ S25SSc ^atggctga gaaaaagtag 2,0 
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